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) Title: PRODUCTION OF ERYTHROPOIETIN 



Abstract 

Novel polypeptides possessing pan or all of the primary 
ciural conformation and one or more of the biological pro- 
ties of mammalian erythropoietin ('EPO') which are charac- 
zed in preferred forms by being the product of procaryotic 
ucaryoiic host expression of an exogenous DNA sequence, 
straii'vely, genomic DNA, cDNA and manufactured DNA 
uences coding for part or all of the sequence of amino acid 
dues of EPO or for analogs thereof are incorporated into 
3nomously replicating plasmid or viral vectors employed to 
isform or transfect suitable procaryotic or eucaryotic host 
s such as bacteria, yeast or venebraie cells in culture. Upon 
ation from culture media or cellular lysates or fragments, 
ducts of expression of the DNA sequences display, e.g., the 
nunological properties and in vitro and in vivo biological ac- 
ties of EPO of human or monkey species origins. Disclosed 
) are chemically synthesized polypeptides sharing the bio- 
mical and immunological properties of EPO. Also disclosed 
improved methods for the detection of specific single 
nded polynucleotides in a heterologous cellular or viral 
tple prepared from^e.g., DNA present in a plasmid or viral- 
ne cDNA or genomic DNA 'library'. 
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"PRCCUCTICN OF ERYTHROPOIETIN" 

This is a continuation-in-part of my co-pending 
U.S. Patent Application Serial Nos. 561,024, filed 
5 December 13, 1983, 582,135, filed February 21, 1984, and 
655,341, filed September 23, 1984. 

BACKGROUND 

10 The present invention relates generally to the 

manipulation of genetic materials and, more particularly, 
to recombinant procedures making possible the production 
of polypeptides possessing part or all of the primary 
structural conformation and/or one or more of the biolo- 

15 gical properties of naturally-occurring erythropoietin. 

A, Manipulation Of Genetic Materials 

Genetic materials may be broadly defined as 
those chemical substances which program for and guide the 

20 manufacture of constituents of cells and viruses and 

direct the responses of cells and viruses. A long chain 
polymeric substance known as deoxyribonucleic acid (DNA) 
comprises the genetic material of all living cells and 
viruses except for certain viruses which are programmed 

25 by ribonucleic acids (RNA). The repeating units in DNA 
polymers are four different nucleotides, each of which 
consists of either a purine (adenine or guanine) or a 
pyrimidine (thymine or cytosine) bound to a deoxyribose 
sugar to which a phosphate group is attached. Attachment 

30 of nucleotides in linear polymeric form is by means of 
fusion of the 5' phosphate of one nucleotide to the 3' 
hydroxyl group of another. Functional DNA occurs in the 
form of stable double stranded associations of single 
strands of nucleotides (known as deoxyoligonucleo tides ) , 
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which associations occur by means of hydrogen bonding 
between purine and oyrimidine bases [i.e., 
"ccmplementary" associations existing either between ade- 
nine (A) and thymine (T) or guanine (G) and cytosine 

5 CC)]. ay convention, nucleotides are referred to by the 
names of their constituent purine or pyrimidine bases, 
and the complementary associations of nucleotides in 
double stranded DNA (i.e., A-T and G-C) are referred to 
as "base pairs". Ribonucleic acid is a polynucleotide 

10 comprising adenine, guanine, cytosine and uracil (U), 
rather than thymine, bound to ribcse and a phosphate 
group. 

Most briefly put, the progra'.^ming function of 
DNA is generally effected through a process wherein spe- 
15 cific DNA nucleotide sequences Cgenes) are "transcribed" 
into relatively unstable messenger RNA (mRNA) polymers. 
The mRNA, in turn, serves as a template for the formation 
of structural, regulatory and catalytic proteins from 
amino acids. This mRNA "translation" process involves 
20 the operations of small RNA strands (tRNA) which 

transport and align individual amino acids along the mRNA 
strand to allow for formation of polypeptides in proper 
amino acid sequences. The mRNA "message", derived from 
DNA and providing the basis for the tRNA supply and 
25 orientation of any given one of the twenty amino acids 
for. polypeptide "expression", is in the form of triplet 
"codons" -- sequential groupings of three nucleotide . 
bases. In one sense, the formation of a protein is the 
ultimate form of "expression" of the programmed genetic 
30 message provided by the nucleotide sequence of a gene. 

"Promoter" DNA sequences usually "precede" a 
gene in a DNA polymer and provide a site for initiation 
of the transcription into mRNA. "Regulator" DNA sequen- 
ces, also usually "upstream" of (i.e., preceding) a gene 
35 in 1 given DNA polymer, bind proteins that determine the 
frequency (or rate) of transcriptional initiation. 
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10 



Collectively referrsd to as " promotsr/regulator " or 
"control" ONA sequence, these sequences which precede a 
selected gene (or series of genes) in a functional ONA 
polytner cooperate to determine whether the transcription 
(and eventual expression) of a gene will occur. ONA 
sequences which "follow" a gene in a ONA polymer and pro- 
vide a signal ^for termination of the transcription into 
mRNA are referred to as transcription "terminator" 
sequences . 

A focus of microbiological processing for the 
last decade has been the attempt to manufacture 
industrially and pharmaceutically significant substances 
using organisms which either do not initially have gene- 
tically coded information concerning the desired product 
15 included in their ONA, or (in the case of mammalian cells 
in culture) do not ordinarily express a chromosomal gene 
at appreciable levels. Simply put, a gene that specifies 
the structure of a desired polypeptide product is either 
isolated from a "donor" organism or chemically synthe- 
2C sizea and then stably introduced into another organism 
which is preferably a sel f -replicat ing unicellular orga- 
nism such as bacteria, yeast or mammalian cells in 
culture. Once this is done, the existing machinery for 
gene expression in the "transformed" or " trans fected" 
25 microbial host cells operates to construct the desired 
product, using the exogenous- ONA as a template for 
transcription of mRNA which is then translated into a 
continuous sequence of amino acid residues. 

The art is rich in patent and literature publi- 
30 cations relating to "recombinant ONA" methodologies for 
the isolation, synthesis, purification and amplification 
of genetic materials for use in the transformation of 
selected host organisms. U.S. Letters Patent 
No. 4,237,224 to Cohen, et al., for example, relates to 
35 transformation of unicellular host organisms with 

"hybrid" viral or circular plasmid ONA which includes 
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selected exogenous DNA sequences. The procedures of the 
Cohen, et 3l. patent first involve manufacture of a 
transformation vector by enzymatically cleaving viral or 
circular plasmid DNA to form linear DNA strands. 
5 Selected foreign ("exogenous" or "heterologous") DNA 
strands usually including sequences coding for desired 
product are prepared in linear form through use of simi- 
lar enzymes. The linear viral or plasmid DNA is incu- 
bated with the foreign DNA in the presence of ligating 

10 enzymes capable of effecting a restoration process and 
"hybrid" vectors are formed which include the selected 
exogenous DNA segment "spliced" into the viral or cir- 
cular DNA plasmid. " 

Transformation of compatible unicellular host 

15 organisms with the hybrid vector results in the formation 
of multiple copies of the exogenous DNA in the host cell 
population. In some instances, the desired result is 
simply the amplification of the foreign DNA and the 
"product" harvested is DNA. Mere frequently, the goal of 

20 transformation is the expression by the host cells of the 
exogenous DNA in the form of large scale synthesis of 
isolatable quantities of commercially significant protein 
or polypeptide fragments coded for by the foreign DNA. 
See also, e.g., U.S. Letters Patent Nos. 4,264,731 (to 

25 Shine), 4,273,875 (to Manis), 4,293,652 (to Cohen), and 
European Patent Application 093,619, published November 
9, 1983. 

The development of specific DNA sequence.s for 
splicing into DNA vectors is accomplished by a variety of 

30 techniques, depending to a great deal on the degree of 
" foreignness" of the "donor" to the projected host and 
the size of the polypeptide to be expressed in the host. 
At the risk of over-simplif icat ion , it can be stated that 
three alternative principal methods can be employed: (l) 

35 the "isolation" of double-stranded DNA sequence from the 
genomic ONA of the donor; (2) the chemical manufacture of 
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a DNA saauence providing a code for a polypeptide of 
interest; and C3) the Lr^ vitro synthesis of a double- 
stranded DNA sequence by enzymatic ''reverse transcrip- 
tion" of mRNA isolated from donor cells. The 
5 last-mentioned methods which involve formation of a ONA 
"tomplement*' of mRNA are generally referred to as "cDNA" 
methods* 

Manufacture of DNA sequences is frequently the 
method of choice when the entire sequence of amino acid 

10 residues of the desired polypeptide product is known. 
DNA manufacturing procedures of co-owned, co-pending 
U.S. Patent Application Serial No. 483,451, by Alton, et 
al., (filed April 15, 1983 and corresponding to PCT 
US83/00605, published November 24, 1983 as W083/04053 ) , 

15 for example, provide a superior means for accomplishing 
such highly desirable results as: providing for the pre-, 
sence of alternate codons commonly found in genes which 
are highly expressed in the host organism selected for 
expression (e.g., providing yeast or E . col i "preference" 

2G codons); avoiding the presence of untranslated "intron*' 
sequences (commonly present in mammalian genomic ONA 
sequences and mRNA transcripts thereof) which are not 
readily processed by procaryotic host cells; avoiding 
expression of undesired "leader" polypeptide sequences 

25 commonly coded for by genomic DNA and cDNA sequences but 
frequently not readily cleaved from the polypeptide of 
interest by bacterial or yeast host cells; providing for 
ready insertion of the ONA in convenient expression vec- 
tors in association with desired promoter/regulator and 

30 terminator sequences; and providing for ready construc- 
tion of genes coding for polypeptide fragments and ana- 
logs of the desired polypeptides. 

When the entire sequence of amino acid residues 
of the desired polypeptide is not known, direct manufac- 

35 ture of DNA sequences is not possible and isolation of 

DNA sequences coding for the polypeptide by a cONA method 
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becomes the method of choice despite the potential 
drawbacks in ease of assembly of expression vectors 
capable of providing high levels of microbial expression 
referred to above. Among the standard procedures for 
5 isolating cONA sequences of interest is the preparation 
of plasmid-borne cDNA "libraries" derived from reverse 
transcription of mRNA abundant in donor cells selected as 
responsible for high level expression of genes (e,g., 
libraries of cDNA derived from pituitary cells which 
10 express relatively large quantities of growth hormone 
products). Where substantial portions of the polypep- 
tide's amino acid sequence are known, labelled, single- 
stranded DNA probe sequences duplicating a sequence 
putatively present in the "target" cDNA may be employed 
15 in DNA/DNA hybridization procedures carried out on cloned 
copies of the cONA which have been denatured to single 
stranded form. [See, generally, the disclosure and 
discussions of the art provider in U.S. Patent No. 
4,39A,4A3 to Weissman, et al.' and the recent demonstra- 
20 tions of the use of long oligonucleotide hybridization 
probes reported in Wallace, et al., Nuc .Acids Res. , 6, 
pp. 3543-3557 (1979), and Reyes, et al., P.N.A.S. 
(U,S.A. ) , 79, pp. 3270-3274 (1982), and Jaye, et al., 
Nuc. Acids Res. , U, pp. 2325-2335 (1983). See also, U.S. 
25 Patent No. 4,358,535 to Falkow, et al., relating to 

DNA/DNA hybridization procedures in effecting diagnosis; 
published European Patent Application Nos. 0070685 and 
0070687 relating to light-emitting labels on single 
stranded polynucleotide probes; Davis, et al., "A Manual 
30 for Genetic Engineering, Advanced Bacterial Genetics", 
Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y. 
(1980) at pp. 55-58 and 174-176, relating to colony and 
plaque hybridization techniques; and. New England Nuclear 
(Boston, Mass.) brochures for "Gene Screen" Hybridization 
35 Transfer Membrane materials providing instruction manuals 
for the transfer and hybridization of DNA and RNA, 
Catalog No. NEF.972.] 
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Among the more signficant recent advances in 
hytir idizat ion procedures for the screening of reccmoinant 
clones is the use of labelled mixed synthetic oligo- 
nucleotide probes, each of which is potentially the 
5 complete complement of a specific ONA sequence in the 
hybridization sample including a heterogenous mixture of 
single stranded DNAs or RNAs. These procedures are 
acknowledged to be especially useful in the detection of 
cDNA clones derived from sources which provide extremely 

10 low amounts of mRNA sequences for the polypeptide of 
interest. Briefly put, use of stringent hybridization 
conditions directed toward avoidance of non-specific 
binding can allow, e.g., for the autoradiographic 
visualization of a specific cDNA clone upon the event of 

15 hybridization of the target DNA to that single probe 

within the mixture which is its complete complement. See- 
generally, Wallace, et al., Nuc .Acids Res . , 9, pp. 
379-397 (1931); Suggs, et al. P.N.A.S. [U.S.A. ) , 73, pp. 
6613-6617 (1981); Choo, et al., Nature, 299 , pp. 178-180 

20 ( 1552); Kurachi, et al., P.N.A.S. (U.S.A. ) , 79, 

pp. 6A61-6464 (1982); Ghkubo, et al., P.N.A.S. (U.S.A. ) , 
30, pp. 2196-2200 (1983); and Kornblihtt, et al. 
P.N.A.S. (U.S.A.) , 80, pp. 3218-3222 (1983). In general, 
the mixed probe procedures of Wallace, et al. (1981), 

25 supra , have been expanded upon by various workers to the 
point where reliable results, have reportedly been 
obtained in a cONA clone isolation using a 32 member 
mixed "pool" of 16-base-long (16-mer) oligonucleotide 
probes of uniformly, varying DNA sequences together with 

30 a single U-mer to effect a two-site "positive" confir- 
mation of the presence of cONA of interest. See, 
Singer-Sam, et al., P.N.A.S. (U.S.A.) , 80, pp. 302-806 
(1983). 

The use of genomic DNA isolates is the least 
35 common of the three above-noted methods for developing 



wo 85/02610 




VUS84/0202i 



- 8 - 

specific DNA sequences for use in recombinant procedures. 
This is especially true in the area of recombinant proce- 
dures directed to securing microbial expression of mam- 
malian polypeptides and is due, principally to the 
5 complexity of mammalian genomic DNA. Thus, while 
reliable procedures exist for developing phage-borne 
libraries of genomic DNA of human and other mammalian 
species origins [See, e.g., Lawn, et al. Cell, 15 , 
pp. 1157-1174 (1978) relating to procedures for 

10 generating a human genomic library commonly referred to 
as the **Maniatis Library"; Karn, et al., P.N .A.S. 
(U,S.A, ) , 77, pp. 5172-5176 (1980) relating to a human 
genomic library based on alternatlve^restr iction endo- 
nuclease fragmentation procedure; and Blattner, et al., 

15 Science, 196 , pp. 161-169 (1977) describing construction 
of a bovine genomic library] there have been relatively 
few successful attempts at use of hybridization proce- 
dures in isolating genomic DNA__.n the absence of exten- 
sive foreknowledge of amino aci: or DNA sequences. As 

20 one example, Fiddes, et al., J.Mol. and" Apo .Genetics , 1^, 
pp. 3-18 (1981) report the successful isolation of a gene 
coding for the alpha subunit of the human pituitary gly- 
coprotein hormones from the Maniatis Library through use 
of a "full length" probe including a complete 621 base 

25 pair fragment of a previously-isolated cDNA sequence for 
the alpha subunit. As another example, Das, et al., 
P.N.A.S. (U.S.A. ) , 80, pp. 1531-1535 (1983) report isola- 
tion of human genomic clones for human HLA-DR using a 175 
base pair synthetic oligonucleotide. Finally, Anderson, 

30 et al., P.N.A.5. (U.S.A.), 80, pp. 6338-6342 (1983) 
report the isolation of genomic clone for bovine 
pancreatic trypsin inhibitor (BPTI) using a single probe 
86 base pairs in length and constructed according to the 
known amino acid sequence of BPTI. The authors note a 

35 determination of poor prospects f or ' isolat ing mRNA 

suitable for synthesis of a cDNA library due to apparent 
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lew levels of mRNA in initially targeted parotid gland 
and lung tissue sources and then address the prospects of 
success in probing a genomic library using a mixture of 
labelled probes, stating: "More generally, mixed- 
5 sequence ol igodeoxynucleo tide probes have been used to 
isolate protein genes of unknown sequence from cDNA 
libraries. Such probes are typically mixtures of 8-32 
oligonucleotides, 14-17 nucleotides in length, repre- 
senting every possible codon combination for a small 

10 stretch (5-6 residues) of amino acid sequence. Under 
stringent hybridization conditions that discriminate 
against incorrectly base-paired probes, these mixtures 
are capable of locating specific gene sequences in clone 
libraries of low-to-moderate complexity. Ne vertheles*s , 

15 because of. their short length and heterogeneity, mixed 
probes often lack the specificity required for probing 
sequences as complex as a mammalian genome. This makes 
such a method impractical for the isolation of mammalian 
protein genes when the corresponding mRNAs are 

20 unavailable." (Citations omitted]. 

There thus continues to exist a need in the art 
for improved methods for effecting the rapid and effi- 
cient isolation of cONA clones in instances where little 
is known of the amino acid sequence of the polypeptide 

25 coded for and where "enriched" tissue sources of mRNA are 
not readily available for use in constructing cDNA 
libraries. Such improved methods would be especially 
useful if they were applicable to isolating mammalian 
genomic clones where sparse information is available con- 

30 cerning amino acid sequences of the polypeptide coded for 
by the gene sought. 



B. Erythropoietin As A Polypeptide Of Interest 

Ery thropoiesis , the production of red blood 
35 cells, occurs continuously throughout the human life span 
to offset cell destruction. Ery thropoiesis is a very 
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precisely controlled physiological mechanism enabling 
sufficient numbers of red blood cells to be available in 
the blood for proper tissue oxygenation, but not so many 
that the cells would impede circulation. The formation 
5 of ..red blood cells occurs in the bone marrow and is under 
the control of the hormon e . erythropoietin. 

Erythropoietin, an acidic glycoprotein of 
approximately 34,000 dalton molecular weight, may occur 
in 'three forms: a, 3 and asialo. The a and B forms 

10 differ slightly in carbohydrate components, but have the 
same potency, biological activity and molecular weight. 
The asialo form is an a or B form with the terminal car- 
bohydrate (sialic acid) removed. Erythropoietin is pre- 
sent in very low concentrations in plasma when the body 

15 is in a healthy state wherein tissues receive sufficient 
oxygenation from the existing number. of erythrocytes. 
This normal low concentration is enough to stimulate 
replacement of red blood cells -vhich are lost normally 
through aging. 

20 The amount of erythropoietin in the circulation 

is increased under conditions of hypoxia when oxygen 
transport by blood cells in the circulation is reduced. 
Hypoxia may be caused by loss of large amounts of blood 
through hemorrhage, destruction of red blood cells by 

25 over-exposure to radiation, reduction in oxygen intake 
due to high altitudes or prolonged unconsciousness, or 
various forms of anemia. In response to tissues 
undergoing hypoxic stress, erythropoietin will increase 
red blood cell production by stimulating the conversion 

30 of primitive precursor cells in the bone marrow into pro- 
erythroblasts which subsequently mature, synthesize 
hemoglobin and are released into the circulation as red 
blood cells. When the number of red blood cells in cir- 
culation is greater than needed for normal tissue oxygen 

35 requirements, erythropoietin in circulation is decreased 
See generally. Testa, et al., Exp.Hematol . , 
BCSupD. '8), 144-152 (1980); Tong, et al., J.Biol.Chem. , 
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256( 2^) , 12666-12672 (1981 ); Goldwasser, J . Ce 1 1 . Phy s io 1 . , 
llG(5ucD. 1 ) , 133-135 ( 1982); Finch, Blood , 60( 6) , 
12^1-12^6 ( 1932 ); Sytowski, et al., Expt ,Hematol . , 8( Suop 
32, 52-64 ( 1980: Naughton, Ann .Clin , Lab . Sci . , 13(5) , 
5 432-438 (1983); Weiss, et al., Am, J .Vet .Res . , 

44(10) ,1832-1835 (1983); Lappin, et al., Exp .Hematol . , 
11(7) , 661-666 ( 1983); Baciu, et al . , Ann . N . Y . Acad . Sc i . , 
. 414 , 66-72 (1983); Murphy, et al . , Ac ta , Haematologica 
Jaconica , 46( 7) , 1380-1396 ( 1983); Oessypris, et al . , 

10 3rit .J. Haematol . , 56 , 295-306 (1984); and, Emmanouel, et 
al., Am. J.Physiol . , 247 (1 Pt 2) , F168-76 (1984). 

Because erythropoietin is essential in the pro- 
cess of red blood cell formation, the hormone has poten- 
tial useful application in both the diagnosis and the 

15 treatment of blood disorders characterized by low or 
defective red blood cell production. See, generally, 
Pennathur-Das, et al . , Blood , 63(5) , 1168-71 (1984) and 
Haddy, Am. Jour . Ped . Hema to 1 . /Oncol . , 4 , 191-196, (1982 ) 
relating to erythropoietin in possible therapies for 

20 sickle cell disease, and Eschbach, et al . J.Clin. Invest , , 
74( 2 ) , pp. 434-441, (1984), describing a therapeutic 
regimen for uremic sheep based on ijn vivo response to 
erythropoiet in-rich plasma infusions and proposing a 
dosage of 10 U EPO/kg per day for 15-40 days as correc- 

25 tive of anemia of the type associated with chronic renal 
failure* See also, Krane, Henry Ford Hoso.Med.J. , 31(3 ) , 
177-181 (1983). 

It has recently been estimated that the availa- 
bility of erythropoietin in quantity would allow for 

30 treatment each year of anemias of 1,600,000 persons in 
the United States alone. See, e.g., Morrison, 
"Bioprocessing in Space an Overview", pp. 557-571 in 
The World Biotech Report 1984, Volume 2:USA, (Online 
Publications, New York, N.Y. 198^). Recent studies have 

35 provided a basis for projection of efficacy of erythro- 
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poietin therapy in a variety of disease states, disorders 
and states of hematologic irregularity: Vedovato, et 
al., Acta .Haematol , 71, 211-213 (1984) 
( beta-thalassemia ) ; Vichinsky, et al., J .Pediatr . , 

5 105(1) . 15-21 (1984) (cystic fibrosis); Cotes, et al., 
Brit. J.Obstet.Gyneacol. . 90(4) . 304-311 (1983) 
(pregnancy, menstrual disorders); Haga, et al . , 
Acta. Pediatr .Scand. . 72, 827-831 (1983) (early anemia of 
prematurity); Glaus-Walker, et al., 

10 Arch.Phys .Med .Rehabil ♦ , 65, 370-374 (1984) (spinal cord 
injury); Dunn, et al., Eur . J .Appl . Physiol . , 52 , 178-182 
(1984) (space flight); Miller, et al. . Brit . J .Haematol . , 
52, 545-590 (1982) (acute blood "Tos^); Udupa, et al., 
J. Lab. Clin. Med. , 103(4) . 574-580 and 531-588 (1984); and 

15 Lipschitz, etal.. Blood , 63(3) . 502-509 (1983) (aging); 
and Dainiak, et al., Cancer . 51( 6) , 1101-1106 (1983) and 
Schwartz, et al . , Otolaryngol. . 109 , 269-272 (1983) 
(various neoplastic disease ststes accompanied by abnor- 
mal erythropoiesis) . 

20 Prior attempts to obtain erythropoietin in good 

yield from plasma or urine have proven relatively unsuc- 
cessful. Complicated and sophisticated laboratory tech- 
niques are necessary and generally result in the 
collection of very small amounts of impure and unstable 

25 extracts containing erythropoietin. 

U.S. Letters Patent No. 3,033,753 describes a 
method for partially purifying erythropoietin from sheep 
blood plasma which provides low yields of a crude, solid 
extract containing erythropoietin. 

30 Initial attempts to isolate erythropoietin from 

urine yielded unstable, biologically inactive prepara- 
tions of the hormone. U.S. Letters Patent No. 3,865,801 
describes a method of stabilizing the biological activity 
of a crude substance containing erythropoietin recovered 

35 from urine. The resulting crude preparation containing 
erythropoietin purportedly retains 90* of erythropoietin 
activity, and is stable. 
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Another method of purifying human erythropoietin 
from urine of patients with aplastic anemia is described 
in Miyaks, et al., J .Biol .Chem . , Vol. 252, No. 15 (August 
10, 1977), pp. 5553-5564. This seven-step procedure 
5 includes ion exchange chromatography, ethanol precipita- 
tion, gel filtration, and adsorption chromatography, and 
yields a pure erythropoietin preparation with a potency 
of 70,400 units/mg of protein in 21% yield. 

U.S. Letters Patent No. 4,397,840 to Takezawa, 

10 et al. describes methods for preparing "an erythropoietin 
product" from healthy human urine specimens with weakly 
basic ion exchangers and proposes that the low molecular 
weight products obtained "have no inhibitory effects 
against erythropoietin. 

15 U.K. Patent Application No. 2,085,887 by 

Sugimoto, et al . , published May 6, 1982, describes a pro-- 
cess for the production of hybrid human lymphoblastoid 
cells, reporting production levels ranging from 3 to 420 
Units of erythropoietin per ml of suspension of cells 

20 (distributed into the cultures after mammalian host propaga 
tion containing up to 10^ cells per ml. At the highest pro 
duction levels asserted to have been obtained, the rate 
of erythropoietin production could be calculated to be 
from 40 to about 4,000 Units/10^ cells/48 hours in in 

25 vitro culture following transfer of cells from iri vivo 
propagation systems. (See also the equivalent U.S. 
Letters Patent No. 4,377,513.) Numerous proposals have 
been made for isolation of erythropoietin from tissue 
sources, including neoplastic cells, but the yields have 

30 been quite low. See, e.g., Jelkman, et al., 

Expt .Hematol . , 11( 7) , 581-588 (1983); Tambourin, et al., 
P.N,A.S. (U.S.A.) , 80, 6269-6273 (1983); Katsuoka, et 
al., Gann , 74 , 534-541 (1983); Hagiwara, et al . , Blood , 
63(4) , 828-835 (1984); and Choppin, et al . , Blood , 64(2) , 

35 341-347 (1934 ) . 

Other isolation techniques utilized to obtain 
purified erythropoietin involve immunological procedures. 
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A polyclonal, serum-derived antibody directed against 
erythropoietin is developed by injecting an animal, pre- 
ferably a rat or rabbit, with human erythropoietin. The 
injected human erythropoietin is recognized as a foreign 
5 antigenic substance by the immune system of the animal 
and elicits production of antibodies against the antigen. 
Differing cells responding to stimulation by the antige- 
nic substance produce and release into circulation anti- 
bodies slightly different from those produced by other 

10 responding cells* The antibody activity remains in the 
serum of the animal when its blood is extracted. While 
unpurified serum or antibody preparations purified as a 
serum immunoglobulin G fraction may^'dnen be used in 
assays to detect and complex with human erythropoietin,. 

15 the materials suffer from a major disadvantage. This 

serum antibody, composed of all the different antibodies 
produced by individual cells, is polyclonal in nature and 
will complex with components in crude extracts other than 
erythropoietin alone. 

20 Of interest to the background of the present 

invention are recent advances in the art of developing 
continuous cultures of cells capable of producing a 
single species of antibody which is specifically immuno- 
logically reactive with a single antigenic determinant of 

25 a selected antigen. See, generally, Chisholm, High 

Technology , Vol. 3, No. 1, 57-63 (1983). Attempts have 
been made to employ cell fusion and hybridization tech- 
niques to develop "monoclonal" antibodies to erythro- 
poietin and to employ these antibodies in the isolation 

30 and quantitative detection of human erythropoietin. As 
one example, a report of the successful development of 
mouse-mouse hybridoma cell lines secreting monoclonal 
antibodies to human erythropoietin appeared in abstract 
form in Lee-Huang, Abstract No. 1463 of Fed.Proc. , 41, 

35 520 (1982). As another example, a detailed description 
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of the preparation and use of a monoclonal, antl- 
er ythropoietin antibody appears in Weiss, et al., 
P.N,A>5. (U.S.AJ , 79, 5^65-5469 (1982). See also, 
Sasaki, Biomed > Siochim . Acta . , 42( 11/12) , S202-S206 
5 (1983); Yanagawa, et al . , Blood , 64(2) , 357-364' ( 1934 ) ; 
Yanagawa, et al., J . Biol .Chem > , 259(5) , 2707-2710 (1984); 
and U.S. Letters Patent No. 4,465,624. 

Also of interest to the background of the inven- 
tion are reports of the immunological activity of synthe- 

10 tic peptides which substantially duplicate the amino acid 
sequence extant in naturally-occurring proteins, 
glycoproteins and nucleoproteins • More specifically, 
relatively low molecular weight polypeptides have been 
shown to participate in immune reactions which are simi- 

15 lar in duration and extent to the immune reactions of 

physiologically significant proteins such as viral anti- 
gens, polypeptide hormones, and the like. Included among 
the immune reactions of such polypeptides is the provoca- 
tion of the formation of specific antibodies in 

20 immunologically active animals. See, e.g.,- Lerner, et . 
al., Cell , 23, 309-310 (1981); Ross, et al . , Nature , 294, 
654-656 (1981); Walter, et al., P.N.A.S. (U.S.A.) , 77, 
5197-5200 (1980); Lerner, et al., P.N.A.S. (U.S.A . ) , 78, 
3403-3407 (1981 ); Walter, et al., P .N , A . S . ( U ■ S . A , ) , 28, 

25 4882-4886 (1981 ); Wong, et al . , P.N .A.S. (U.S.A. ) , 78, 

7412-7416 (1981); Green, et-al. Cell . 28, 477-487 (1982); 
Nigg, et al., P.N. A.S. (U.S.A.) , 79, 5322-5326 ( 1982 ); 
Baron, et al., Cell , 28, 395-404 (1982); Dreesman, et 
al., Nature , 295, 158-160 (1982); and Lerner, Scientific 

30 American , 248, No. 2, 66-74 (1983). See, also, Kaiser, 
et al., Science , 223, pp. 249-255 (1984) relating to 
biological and immunological activities of synthetic pep- 
tides which approximately share secondary structures of 
peptide hormones but may not share their primary struc- 

35 tural conformation. The above studies relate, of course, 
to amino acid sequences of proteins other than erythro- 
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poietin, a substance for which no substantial amino acid 
sequence information has been published. In co-owned, 
co-pending U.S. Patent Application Serial No. 463,724, 
filed February 4, 1983, by J. Egrie, published August 22, 
5 1984 as European Patent Application No. 0 116 446, there 
is described a mouse-mouse hybridoma cell line 
(A.T.C.C. No. HB8209) which produces a highly specific 
monoclonal, anti-erythropoietin antibody which is also 
specifically immunoreactive with a polypeptide comprising 
10 the following sequence of amino acids: 

NHj-Ala-Pro-Pro-Arg-Leu-Ile-Cys-Asp-Ser-Arg-Val-Leu- 
Glu-Arg-Tyr-Leu-Leu-Glu-Ala-Lys-COOH . 
The polypeptide sequence is one as^sTgned to the first 
twenty amino acid residues of mature human erythropoietin 
15 isolated according to the method of Miyake, et al., 

J.Biol .Chem. , 252 , 5558-5564 (1977) and upon which amino 
acid analysis was performed by the gas phase sequencer 
(Applied aiosystems, Inc.) according to the procedure of 
Hewick, M., et al., J. Biol .Cher . , 256 , 7990-7997 (1981). 
20 See, also, Sue, et al., Proc. Nat. Acad. Sci. (USA) , 80, 
pp. 3651-3655 (1983) relating to development of polyclo- 
nal antibodies against a synthetic 26-mer based on a dif- 
fering amino acid sequence, and Sytowski, et al., 
J. Immunol. Methods , 69, pp. 181-186 (1984). 
25 While polyclonal and monoclonal antibodies as 

described above provide highly useful materials for use 
in immunoassays for detection and quantification of 
erythropoietin and can be useful in the affinity .purifi- 
cation of erythropoietin, it appears unlikely that these 
30 materials can readily provide for the large scale isola- 
tion of quantities of erythropoietin from mammalian sour- 
ces sufficient for further analysis, clinical testing and 
potential wide-ranging therapeutic use of the substance 
in treatment of, e.g., chronic kidney disease wherein 
35 diseased tissues fail to sustain production of erythro- 
poietin. It is consequently projected in the art that 
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the best prospects for fully characterizing mammalian 
erythropoietin and providing large quantities of it for 
potential diagnostic and clinical use involve successful 
application of recombinant procedures to effect large 
5 scale microbial synthesis of the compound. 

While substantial efforts appear to have been 
made in attempted isolation of DNA sequences coding for 
human and other mammalian species erythropoietin, none 
appear to have been successful. This is due principally 

10 to the scarcity of tissue sources, especially human 

tissue sources, enriched in mRNA such as would allow for 
construction of a cDNA library from which a ONA sequence 
coding for erythropoietin might be isolated by conven- 
tional techniques. Further, so little is known of the 

15 continuous sequence of amino acid residues of erythro- 
poietin that it is not possible to construct, e.g., long 
polynucleotide probes readily capable of reliable use in 
ONA/DNA hybridization screening of cDNA and especially 
genomic ONA libraries. Illustratively, the twenty amino 

20 acid sequence employed to generate the above-named 

monoclonal antibody produced by A.T.C.C. No. HB8209 does 
not admit to the construction of an unambiguous, 60 base 
oligonucleotide probe in the manner described by 
Anderson, et al., supra . It is estimated that the human 

25 gene for erythropoietin may appear as a "single copy 
gene" within the human genome and, in any event, the 
genetic material coding for human erythropoietin is 
likely to constitute less than 0.00005% of total human 
genomic DNA which would be present in a genomic library. 

30 To date, the most successful of known reported 

attempts at recombinant-related methods to provide DNA 
sequences suitable for use in microbial expression of 
isolatable quantities of mammalian erythropoietin have 
fallen far short of the goal. As an example, Farber, et 

35 al. ExD.Hematol . , 11^. Supp. U, Abstract 101 (1983) 
report the extraction of mRNA from kidney tissues of 
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phenylhydrazine-treated baboons and the injection of the 
mRNA into Xenoous laevis oocytes with the rather tran- 
sitory result of in vitro production of a mixture of 
"translation products" which included among them 
5 displaying biological properties of erythropoietin. More 
recently, Farber, et al . , Blood , 62, No. 5, Supp. No. 1, 
Abstract 392, at page 122a (1983) reported the in vitro 
translation of human kidney mRNA by frog oocytes. The 
resultant translation product mixture was estimated to 

10 include on the order of 220 mU of a translation product 
having the activity of erythropoietin per microgram of 
injected mRNA. While such levels of in vitro translation 
of exogenous mRNA coding for erythropoietin were 
acknowledged to be quite low (compared even to the prior 

15 reported levels of baboon mRNA translation into the 

sought-for product) it was held that, the results confirm 
the human kidney as a site of erythropoietin expression, 
allowing for the construction cf an enriched human kidney 
cDNA library from which the desired gene might be iso- 

20 lated, Csee also, Farber, Clin. Res. , 31(A) , 769A 
(1983).] 

Since the filing of U.S. Patent Application 
Serial Nos. 561,024 and 582,185, there has appeared a 
single report of the cloning and expression of what is 

25 asserted to have been human erythropoietin cDNA in 
E.coli . Briefly put, a number of cDNA clones were 
inserted into E.coli plasmids and 6-lactamase fusion pro- 
ducts were noted to be immunoreactive with a monoclonal 
antibody to an unspecified "epitope" of human erythro- 

30 poietin. See , Lee -Huang , Proc^ Na^ A£ad^ Sci^ 
81, pp. 2708-2712' (1984) . 

BRIEF SUMMARY 

35 The present invention provides, for the first 

time, novel purified and isolated polypeptide products 
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having part or all of the primary structural conformation 
(i.3., continuous sequence of amino acid residues) and 
one or more of the biological properties (e.g., immunolo- 
gical properties and' ui vivo and i_n vitro biological 
5 activity) of naturally-occurring erythropoietin, 

including allelic variants thereof. These polypeptides 
are also uniquely characterized by being the product of 
procaryotic or eucaryotic host expression (e.g., by bac- 
terial, yeast and mammalian cells in culture) of exoge- 

10 nous DNA sequences obtained by genomic or cONA cloning or 
by gene synthesis. Products of microbial expression in 
vertebrate (e.g., mammalian and avian) cells may be 
further characterized by freedom from association with 
human proteins or other contaminants which may be asso- 

15 ciated with erythropoietin in its natural mammalian 

cellular environment or in extracellular fluids such as 
plasma or urine. The products of typical yeast (e.g., 
Saccaromvces cere vis iae ) or procaryote (e.g., E . col i ) 
host cells are free of association with any mammalian 

20 proteins. Depending upon the host employed, polypeptides 
of the invention may be glycosylated with mammalian or 
other eucaryotic carbohydrates or may be non- 
glycosylated . Polypeptides of the invention may also 
include aninitial methionine amino acid residue (at 

25 position -1 ) . 

Novel glycoprotein products of the invention 
include those having a primary structural conformation 
sufficiently duplicative of that of a naturally-occurring 
(e.g., human) erythropoietin to allow . possession of one 

30 or more of the biological properties thereof and having 
an average carbohydrate composition which differs from 
that of naturally-occurring (e.g., human) erythropoietin. 

Vertebrate (e.g., COS-1 and CHO) cells provided 
by the present invention comprise the first cells ever 

35 available which can be propagated iji vitro continuously 
and which upon growth in culture are capable of producing 
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in the medium of their growth in excess of lOOU 
(preferably in excess of 500U and most preferably in 
excess of 1,000 to 5,000U) of erythropoietin per 
10^ cells in 48 hours as determined by radioimmunoassay. 
5 Also provided by the present invention are 

synthetic polypeptides wholly or partially duplicative of 
continuous sequences of erythropoietin amino acid resi- 
dues which are herein for the first time elucidated. 
These sequences, by virtue of sharing primary, secondary 

10 or tertiary structural and conformational characteristics 
with naturally-occurring erythropoietin may possess 
biological activity and/or immunological properties in 
common with the naturally-occur flng~^product such that 
they may be employed as biologically active or immunolo- 

15 gical substitutes for erythropoietin in therapeutic and 
immunological processes. Correspondingly provided are 
monoclonal and polyclonal antibodies generated by stan- 
dard means which are immunor ea^ : ive with such polypep- 
tides and, preferably, also im^ jnoreacti ve with 

20 naturally-occurring erythropoietin. 

Illustrating the present invention are cloned 
ONA sequences of monkey and human species origins and 
polypeptide sequences suitably deduced therefrom which 
represent, respectively, the primary structural confor- 

25 mation of erythropoietins of monkey and human species 
origins. 

Also provided by the present invention are novel 
biologically functional viral and circular plasmid ONA 
vectors incorporating ONA sequences of the invention and 

30 microbial Ce.g., bacterial, yeast and mammalian cell) 
host organisms stably transformed or transfected with 
such vectors. Correspondingly provided by the invention 
are novel methods for the production of useful polypep- 
tides comprising cultured growth of such transformed or 

35 transfected microbial hosts under conditions facilitative 
of large scale expression of the exogenous, vector-borne 
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ONA sequences and isolation of the desired polypeptides 
from the growth medium, cellular lysates or cellular 
membrane fractions. 

Isolation and purification of microbially 
5 expressed polypeptides provided by the invention may be 
by conventional means including, e.g., preparative chro- 
matographic separations and immunological separations 
involving monoclonal and/or polyclonal antibody prepara- 
tions. 

10 Having herein elucidated the sequence of amino 

acid residues of erythropoietin, the present invention 
provides for the total and/or partial manfucture of ONA 
sequences coding for erythropoietin and including such 
advantageous characteristics as incorporation of codons 

15 "preferred" for expression by selected non-mammalian 
hosts, provision of sites for cleavage by restriction 
endonuclease enzymes and provision of additional initial, 
terminal or intermediate DNA sequences which facilitate 
construction of readily expressed vectors. Corres- 

20 pondingly, the present invention provides for manufacture 
[and development by site specif ic mutagenesis of cDNA and 
genomic DNA) of DNA sequences coding for microbial 
expression of polypeptide analogs or derivatives of 
ery thr ODOiet in which differ from naturally-occurring 

25 forms in terms of the identity or location of one or more 
amino acid residues (i.e., deletion analogs containing 
less than all of the residues specified for EPO and/or 
substitution analogs wherein one or more residues spe- 
cified are replaced by other residues and/or addition 

30 analogs wherein one or more amino acid residues is added 
to a terminal or medial portion of the polypeptide); and 
which share some or all the properties of naturally- 
occurring forms. 

Novel ONA sequences of the invention include all 

35 sequences useful in securing expression in procaryotic or 
eucaryotic host cells of polypeptide products having at 
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least a part of the primary structural conformation and 
one or more of the biological properties of erythro- 
poietin which are comprehended by: (a) the DNA sequences 
set out in Tables V and VI herein or their complementary 
5 strands; (b) DNA sequences which hybridize (under hybri- 
dization conditions such as illustrated herein or more 
stringent conditions) to DNA sequences defined in (a) or 
fragments thereof; and Cc] DNA sequences which, but for 
the degeneracy of the genetic code, would hybridize to 

10 DNA sequences defined in (a) and (b) above. Specifically 
comprehended in part (b) are genomic DNA sequences 
encoding allelic variant forms of monkey and human 
erythropoietin and/or encoding' other mammalian species of 
erythropoietin. Specifically comprehended by part (c)-. 

15 are manufactured DNA sequences encoding EPO, EPO 

fragments and EPO analogs which DNA sequences may incor- 
porate codons facilitating translation of messenger RNA 
in non- ver tebr a te hosts. 

Comprehended by the present invention is that 

20 class of polypeptides coded for by portions of the DNA 
complement to the top strand human genomic DNA sequence 
of Table VI herein, i.e., "complementary inverted pro- 
teins" as described by Tramontano, et al., Nucleic Acids 
Research , 1^, PP- 5049-5059 C1984). 

25 Also comprehended by the invention are phar- 

maceutical compositions comprising effective amounts of 
polypeptide products of the invention together with 
suitable diluents, adjuvants and/or carriers which allow 
for provision of erythropoietin therapy, especially in 

30 the treatment of anemic disease states and most espe- 
cially such anemic states as attend chronic renal 
failure. 

Polypeptide products of the invention may be 

••labelled" by covalent association with a detectable 

125 

35 marker substance (e.g., radiolabelled with I) to pro- 
vide reagents useful in detection and quantification of 
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erythropoietin in solid tissue and fluid samples such as 
blood or urine. DNA products of the invention may also 
be labelled with detectable markers (such as radiolabels 
and non-isotopic labels such as biotin) and employed in 
5 DNA hybridization processes to locate the erythropoietin 
gene position and/or the position of any related gene 
family in the 'human, monkey and other mammalian species 
chromosomal map. They can also be used for identifying 
the erythropoietin gene disorders at the ONA level and 

10 used as gene markers for identifying neighboring genes 
and their disorders. 

As hereinafter described in detail, the present 
invention further provides significant improvements in 
methods for detection of a specific single stranded poly- 

15 nucleotide of unknown sequence in a heterogeneous cellu- 
lar or viral sample including multiple single-stranded 
polynucleotides where 

(a) a mixture of labelled single-stranded poly- 
nucleotide probes is prepared having uniformly varying 

20 sequences of bases, each of said probes being potentially 
specifically complementary to a sequence of bases which 
is putatively unique to the polynucleotide to be 
detected , 

Cb) the sample is fixed to a solid substrate, 
25 (c) the substrate having the sample fixed 

thereto is treated to diminish further binding of poly- 
nucleotides thereto except by way of hybridization to 
polynucleotides in said sample, 

Cd) the treated substrate having the sample 
30 fixed thereto is transitorily contacted with said mixture 
of labelled probes under conditions facilitative of 
hybridization only between totally complementary poly- 
nucleotides, and, 

Ce) the specific polynucleotide is detected by 
35 monitoring for the presence of a hybridization reaction 
between it and a totally complementary probe within said 
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mixture of labelled probes, as evidence'd by the presence 
of a higher density of labelled material on the substrate 
at the locus of the specific polynucleotide in comparison 
to a background density of labelled material resulting 
5 from non-specific binding of labelled probes to the 
substrate. 

The procedures are especially effective in 
situations dictating use of 6A, 128, 256, 512, 1024 or 
more mixed polynucleotide probes having a length of 17 to 

10 20 bases in ONA/DNA or RNA/RNA or DNA/RNA hybridizations. 

As described infra , the above-noted improved 
procedures have illustratively allowed for the iden- 
tification of cONA clones coding" 'f of ^ erythropoietin of 
monkey species origins within a library prepared from 

15 anemic monkey kidney cell mRNA. More specifically, a 
mixture of 128 uniformly varying 20-mer probes based on 
amino acid sequence information derived from sequencing 
fractions of human erythropoietin was employed in colony 
hybridization procedures to identify seven "positive" 

20 erythropoietin cDNA clones within a total of 200,000 

colonies. Even more remarkably, practice of the improved 
procedures of the invention have allowed for the rapid 
isolation of three positive clones from within a 
screening of 1,500,000 phage plaques constituting a human 

25 genomic library. This was accomplished through use of 
the above-noted mixture of 128 20-mer probes together 
with a second set of 128 17-mer probes based on amino 
acid analysis of a different continuous sequence of human 
erythropoietin . 

30 The above-noted illustrative procedures consti- 

tute the first known instance of the use of multiple 
mixed oligonucleotide probes in ONA/DNA hybridization 
processes directed toward isolation of mammalian genomic 
clones and the first known instance of the use of a mix- 

35 ture of more than 32 oligonucleotide probes in the isola- 
tion of cONA clones. 
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Numerous aspects and advantages of the invention 
will be apparent to those skilled in the art upon 
consideration of the following detailed description which 
provides illustrations of the practice of the invention 
5 in its presently preferred embodiments. 

DETAILED DESCRIPTION 

According to the present invention, ONA 

10 sequences encoding part or all of the polypeptide 
sequence of human and monkey species erythropoietin 
(hereafter, at times, "EPO") have been isolated and 
characterized. Further, the monkey and human origin DNA 
has been made the subject of eucaryotic and procaryotic 

15 expression providing isolatable quantities of polypep- 
tides displaying biological (e.g., immunological) proper- 
ties of naturally-occurring EPO as well as both in vivo 
and i£ vitro biological activities of EPO, 

The DNA of monkeV slsecies origins was isolated 

23 from a cDNA library constructed with mRNA derived from 
kidney tissue of a monkey in a chemically induced anemic 
state and whose serum was immunologically determined to 
include high levels of EPO compared to normal monkey 
serum. The isolationof the desired cONA clones con- 

25 taining EPO encoding DNA was accomplished through use of 
DNA/DNA colony hybridization employing a pool of 128 
mixed, radiolabelled , 20-mer oligonucleotide probes and 
involved the rapid screening of 200,000 colonies. Design 
of the oligonucleotide probes was based on amino acid 

30 sequence information provided by enzymatic fragmentation 
and sequencing a small sample of human EPO. 

The DNA of human species origins was isolated 
from a human genomic DNA library. The isolation of 
clones containing EPO-encoding DNA was accomplished 

35 through DNA/DNA plaque hybridization employing the above- 
noted pool of 128 mixed 2Q-mer oligonucleotide probes and 
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a second pool of 128 radiolabelled 17-mer probes whose 
sequences were based on amino acids sequence information 
obtained from a different enzymatic human EPO fragment. 
Positive colonies and plaques were verified by 
5 means of dideoxy sequencing of clonal DNA using a subset 
of 16 sequences within the pool of 20-mer probes and 
selected clones were subjected to nucleotide sequence 
analysis resulting in deduction of primary structural 
conformation of the EPO polypeptides encoded thereby. 

10 The deduced polypeptide sequences displayed a high degree 
of homology to each other and to a partial sequence 
generated by amino acid analysis of human EPO fragments. 

A selected positive monkey^cONA clone and a 
selected positive human genomic clone were each inserted 

15 in a "shuttle" DNA vector which was amplified in E .coli 
and employed to transfect mammalian . cells in culture. 
Cultured growth of transfected host cells resulted in 
culture medium supernatant preparations estimated to con- 
tain as much as 3000 mU of EPO per ml of culture fluid. 

20 The following examples are presented by way of 

illustration of the invention and are specifically 
directed to procedures carried out prior to iden- 
tification of EPO encoding monkey cONA clones and human 
genomic clones, to procedures resulting in such iden- 

25 tification, and to the sequencing, development of 

expression systems and immunological verification of EPO 
expression in such systems. 

More particularly, Example 1 is directed to 
amino acid sequencing of human EPO fragments and con- 

30 struction of mixtures of radiolabelled probes based on 
the results of this sequencing. Example 2 is generally 
directed to procedures involved in the identification of 
positive monkey cONA clones and thus provides information 
concerning animal treatment and preliminary radioim- 

35 munoassay (RIA) analysis of animal sera. Example 3 is 
directed to the preparation of the cDNA library, colony 
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hybridization screening and verification of positive 
clones, DNA sequencing of a positive cDNA clone and the 
generation of monkey £?0 polypeptide primary structural 
conformation (amino acid sequence] information. Example 
5 4 is directed to procedures involved in the iden- 
tification of positive human genomic clones and thus pro- 
vides information concerning the source of the genomic 
library, plaque hybridization procedures and verification 
of positive clones. Example 5 is directed to DNA 
10 sequencing of a positive genomic clone and the generation 
of human EPO polypeptide amino acid sequence information 
including a comparison thereof to the monkey EPO sequence 
information. Example 6 is directed to procedures for 
construction of a vector incorporating EPO-encoding DNA 
15 derived from a positive monkey cONA clone, the use of the 
vector for transfection of COS-1 cells and cultured 
growth of the transfected cells. Example 7 is directed 
to procedures for construction of a vector incorporating 
EPO-encoding DNA derived"fr6m a positive human genomic 
20 clone, the use of the vector for transfection of COS-1 
cells and the cultured growth of the transfected cells. 
Example 8 is directed to immunoassay procedures performed 
on media supernatants obtained from the cultured growth 
of transfected cells_ according to Example 6 and- 7. 
25 Example 9 is directed to in vitro and in vivo biological 
activity of microbially expressed EPO of Examples 6 and 
7. 

Example 10 is directed to a development of mam- 
malian host expression systems for monkey species EPO 

30 cDNA and human species genomic DNA involving Chinese 

hamster ovary ("CHO") cells and to the immunological and 
biological activities of products of these expression 
systems as well as characterization of such products. 
Example 11 is directed to the preparation of manufactured 

35 genes encoding human species EPO and EPO analogs, which 
genes include a number of preference codons for 



wo 83/02610 



- 23 - 



;T/US84/02021 



expression in E , col i and yeast host cells, and to 
expression systems based thereon. Example 12 relates to 
the immunological and biological activity profiles of 
expression products of the systems of Example 11. 

5 

EXAMPLE 1 

A. Human EPQ Fragment Amino Acid Sequencing 

Human EPO was isolated from urine and subjected 

10 to tryptic digestion resulting in the development and 

isolation of 17 discrete fragments in quantities approxi- 
mating 100-150 picomoles. 

Fragments were arbitrarily^ assigned numbers and 
were analyzed for amino acid sequence by microsequence 

15 analysis using a gas phase sequencer (Applied Biosystems] 
to provide the sequence information set out in Table I, 
below, wherein single letter codes are employed and "X" 
designates a residue which was not unambiguously deter- 
mined . 
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TABLE I 

Fragment No. Sequence analysis Result 

5 T4a A-P-P-R 

T4b G-K-L-K 

T9 • A-L-G-A-Q-K 

T13 V-L-E-R 

T16 A-V-S-G-L-R 

10 T13 L-F-R 

T21 K-L-F-R 

T25 Y-L-L-E-A-K 

T26a L-I-C-D-S-R 

T26b L-Y-T-G-E-A-C-R 

15 T27 T-I-T-A-D-T-F-R 

T28 E-A-I-S-P-P-O-A-A-M-A-A-P-L-R 

T30 E-A-E-X-I-T-T-G-X-A-E-H-X-S-L- 

N-E-X-I-T-V-P 

T31 - V_"Y-S-N-F-L-R 

2C T33 S-L-T-T-L-L-R 

T35 V-N-F-Y-A-W-K 

T38 G-Q-A-L-L-V-X-S-S-Q-P-W- 

E-P-L-Q-L-H-V-D-K 
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B, Design and Construction of 

Oligonucleotide Probe Mixtures 

The amino acid sequences set out in Table I were 
reviewed in the context of the degeneracy of the .genetic 
5 code for the purpose of ascertaining whether mixed probe 
procedures could be applied to DNA/DNA hybridization pro- 
cedures on cDNA and/or genomic DNA libraries. This ana- 
lysis revealed that within Fragment No. T35 there existed 
a series of 7 amino acid residues 

10 ( Val-Asn-Phe-Tyr-Ala-Trp-Lys ) which could be uniquely 
characterized as encoded for by one of 128 possible DNA 
sequences spanning 20 base pairs. A first set of 128 
20-mer oligonucleotides was therefore synthesized by 
standard phosphoamidi te methods (See, e.g., Beaucage, et 

15 al., Tetrahedron Letters , 22, pp. 1859-1862 (1981) on a 
solid support according to the sequence set out in Table 
II, below. 



20 



TABLE II 



Residue - Val - Asn Phe Tyr Ala Trp Lys 

3' CAA TTG AAG ATG CGA ACC TT - 5* 

T A A A T 

G G 

25 C C 

Further analysis revealed that within fragment 
No. T38 there existed a series of 6 amino acid residues 
(Gln-Pro-Trp-Glu-Pro-Leu) on the basis of which there 
could be prepared a pool of 128 mixed olignucleotide 
30 17-mer probes as set out in Table III, below. 

TABLE III 



Residue - Gin Pro Trp Glu Pro Leu 

35 y GTT GGA ACC CTT GGA GA - 5' 

C T C T A 

G G 
C C 
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Oligonucleotide probes were labelled at the 5' 
end with gamma - •'^P-ATP, 7500-8000 Ci/mmole (ICN) using 
polynucleotide kinase (NEN). 

5 EXAMPLE 2 

A. Monkey Treatment Procedures and RIA Analysis 

Female Cynomolgus monkeys Macaca f asc icular ias 
(2.5-3 kg, 1.5-2 years old) were treated subcutaneously 

10 with a pH 7.0 solution of pheny Ihydr azine hydrochloride 
at a dosage level of 12.5 mg/kg on days 1, 3 and 5. The 
hematocrit was monitored prior to each injection. On day 
7, or whenever the hematocrit level fell below 25% of the 
initial level, serum and kidneys were harvested after 

15 administration of 25 mg/kg doses of ketamine hydroch- 
loride. Harvested materials were immediately frozen in 
liquid nitrogen and stored at -70'C. 

B. RIA for EPO 

20 Radioimmunoassay procedures applied for quan- 

titative detection of EPO in samples were conducted 
according to the following procedures: 

An erythropoietin standard or unknown sample was 
incubated together w_ith antiserum for two hours at 37*C. 

25 After the two hour incubation, the sample tubes were 
cooled on ice, ^^^I-labelled erythropoietin was added, 
and the tubes were incubated at O-C for at least 15 more 
hours. Each assay tube contained 500 yl of incubation 
mixture consisting of 50 yl of diluted immune sera, 

30 10,000 cpm of ^^^I-er ythropoiet in , 5 ul trasylol and 

0-250 yl of either EPO standard or unknown sample, with 
PBS containing O.IX BSA making up the remaining volume. 
The antiserum used was the second test bleed of a rabbit 



35 



wo 85/02610 



^/US84/02021 



- 32 - 

immunized with a IX pure preparation of human urinary 

erythropoietin. The final antiserum dilution on the 

125 

assay was adjusted so that the antibody-bound I-EPO 
did not exceed 10-20% of the input total counts. In 
5 general, this corresponded to a final antiserum dilution 
of from 1:50,000 to ■ 1 : 100 , 000 . 

The antibody-bound ^^^I-erythropoietin was pre- 
cipitated by the addition of 150 yl Staph A. After a 40 
min. incubation, the samples were centrifuged and the 

10 pellets were washed two times with 0.75 ml 10 mM Tris-HCl 
pH 8.2 containing 0.15M NaCl, 2mM EDTA, and 0.05% Triton 
X-100. The washed pellets were counted in a gamma 
counter to determine the percent""of I-erythropoiet in 
bound. Counts bound by pre-immune sera were subtracted 

15 from all final values to correct for nonspecific precipi- 
tation. The erythropoietin content of the unknown 
samples was determined by comparison to the standard 
curve . 

The above procedure was applied to monkey serum 
20 obtained in Part A, above, as well as to the untreated 
monkey serum. Normal serum levels were assayed to con- 
tain approximately 36 mU/ml while treated monkey serum 
contained from 1000 to 1700 mU/ml. 

25 . EXAMPLE 3 

A. Monkey cDNA Library Construction 

Messenger RNA was isolated from normal and ane- 
mic monkey kidneys by the guanidinium thiocyanate proce- 

30 dure of Chirgwin, et al., Biochemistry , 18, p. 5294 
(1979 ) and poly (A)"*" mRNA was purified by two runs of 
oligoC dT )-cellulose column chromatography as described at 
pp. 197-198 in Maniatis, et al., "Molecular Cloning, A 
Laboratory Manual" (Cold Springs Harbor Laboratory, Cold 

35 Springs, Harbor, N.Y., 1982). The cDNA library was con- 
structed according to a modification of the general pro- 
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cedures of Okayama, at al., Mol . and Cell . 9iol . , 2, 
pp. 161-170 (1982). The key features of the presently 
preferred procedures were as follows: (1) pUCB was used 
as the- sole vector, cut with ?st I and then tailed with 
5 oligo dT of 60-80 bases in length; (2) Hind i digestion 
was used to remove the oligo dT tail from one end of the 
vector; (3] first strand synthesis and oligo dG tailing 
was carried out according to the published procedure; (4) 
Bam Hl digestion was employed to remove the oligo dG tail 
10 from one end of the vector; and [5) replacement of the 
RNA strand by ONA was in the presence of two linkers 
(GATCTAAAGACCGTCCCCCCCCC and ACGGTCTTTA) in a three-fold 
molar excess over the oligo dG tailed vector. 

15 8, Colony Hybridization Procedures For 

Screening Monkey cDNA Library 

Transformed E .col i were spread out at a density 
of 9000 colonies per 10 x 10 cm plate on nutrient plates 
containing 50 micrograms7ml Ampicillin. GeneScreen 

20 filters (New England Nuclear Catalog No- NEF-972) were 

pre-wet on a 3HI-CAM plate (Bacto brain heart infusion 37 
g/L, Casamino acids 2 g/L and agar 15 g/L, containing 500 
micrograms/ml Chloramphenicol) and were used to lift the 
colonies off the plate. The colonies were grown in the 

25 same medium for 12 hours or longer to amplify the plasmid 
copy numbers. The amplified colonies (colony side up) 
were treated by serially placing the filters over 2 
pieces of Whatman 3 MM paper saturated with each of the 
following solutions: 

30 (1) 50 mM glucose - 25 mM Tris-HCl (pH 8.0) - 

10 mM EDTA (pH 8.0) for five minutes; 

(2) 0.5 M NaOH for ten minutes; and 

(3) 1.0 M Tris-HCl (pH 7.5) for three minutes. 
The filters were then air dried in a vacuum over 

35 at 80'C for two hours. 

The filters were then subjected to Proteinase K 
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digestion through treatment with a solution containing 50 
micrograms/ml of the protease enzyme in Buffer K Co.lM 
Tris-HCl (pH 8.0) - 0.15M NaCl - 10 mM EDTA ( pH 8,2) 
-0.2% SDS] • Specifically, 5 ml of the solution was added 
5 to each filter and the digestion was allowed to proceed 
at 55*C for 30 minutes, after which the solution was 
removed . 

The filters were then treated with 4 ml of a 
prehybridization buffer (5 x SSPE - 0.5% SDS - 100 
10 micrograms/ml SS E.coli DNA - 5 x 8FP). The prehybridi- 
zation treatment was carried out at 55*C, generally for 4 
hours or longer, after which the prehybridization buffer 
was removed. 

The hybridization process was carried out in the 
15 following manner. To each filter was added 3 ml of 
hybridization buffer (5 x SSPE - 0.5% SOS - 100 
micrograms/ml yeast tRNA) containing 0.025 picomoles of 
each of the 128 probe sequences of Table II Cthe total 
mixture being designated the EPV mixture) and the filters 
20 were maintained at 48-C for 20 hours. This temperature 
was 2'C less than the lowest of the calculated disso- 
ciation temperatures (Td) determined for any of the pro- 
bes. 

Following hybridization, the filters were washed 
25 three times for ten minutes on a shaker with 6 x SSC 
'0.1% SDS at room temperature and washed two to three 
times with 6 x SSC - 1% SDS at the hybridization tem- 
perature (48*C). 

Autoradiography of the filters revealed seven 
30 positive clones among the 200,000 colonies screened. 

Initial sequence analysis of one of the putative 
monkey cONA clones (designated clone 83) was performed 
for verification purposes by a modification of the proce- 
dure of Wallace, et al.. Gene , 16, pp. 21-26 (1981). 
35 Briefly, plasmid DNA from monkey cDNA clone 83 was 
linearized by digestion with Eco RI and denatured by 
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heating in a boiling water bath. The nucleotide sequence 
was determined by the dideoxy method of Sanger, et al., 
P.N. A. 5. (U.S.A. ) , 76, pp. 5463-5667 ( 1977). A subset of 
the EPV mixture of probes consisting of 16 sequences was 
5 used as a primer for the sequencing reactions. 

C. Monkey EPO cONA Sequencing 

Nucleotide sequence analysis of clone 83 was 
carried out by the procedures of Messing, Methods in 

10 EnzymoloQv, 101 , pp. 20-78 (1983]. Set out in Table IV 
is a preliminary restriction map analysis of the approxi- 
mately 1600 base pair EcoR I/ Hind lll cloned fragment of 
clone 83. Approximate locations of restriction endo- 
nuclease enzyme recognition sites are provided in terms 

15 of number of bases 3* to the EcoRI site at the 5' end of 
the fragment. Nucleotide sequencing was carried out by 
sequencing individual restriction fragments with the 
intent of matching overlaoping fragments. For example, 
an overlap of sequence information provided by analysis 

2G of nucleotides in a restriction fragment designated C113 
( Sau 3A at -111/SmaI at --324) and the reverse order 
sequencing of a fragment designated C73 ( Alu l at 
^624/ 3st£ II at -203). 

25 
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TABLE IV 



Restriction Enzyme 
Recognition Site 



Approximate Location(s) 



5 


EcoRI 


1 




Sau3A 


111 




Sma I 


180 




BstEII 


203 




Sma I 


324 


10 


Kpnl 


371 




Rsal 


372 




Alul 


424 




PstI 


426 




Alul 


430 


15 


Hpal 


466 




Alul 


546 




PstI 


601 




PvuII 


604 




Alul 


605 


20 


Alul 


782 




Alul 


788 




Rsal 


792 




PstI 


807 




Alul 


841 


25 


Alul 


927 




NCOl 


946 




Sau3A 


1014 




Alul 


1072 




Alul 


1115 


30 


Alul 


1223 




PstI 


1301 




Rsal 


1343 




Alul 


1384 




Hindi! I 


1449 


35 


Alul 


1450 




Hindlll 


1585 
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Sequencing of accrcx imately 13^2 base pairs 
(within the region spanning the 5au 3A site 3' tc the 
EccRI site and the Hind lll site) and analysis of ail 
possible reading frames has allowed for the development 
5 of DNA and amino acid sequence information set out in 
Table V. In the Table, the putative initial amino acid 
residue of the amino terminal of mature EPO (as verified 
by correlation to the previously mentioned sequence ana- 
lysis of twenty amino terminal residues) is designated by 

10 the numeral -^1. The presence of a methionine-specif y ing 
ATG codon (designated -27) "upstream" of the initial 
amino terminal alanine residue as the first residue 
designated for the amino acid sequence of the mature pro- 
tein is indicative of the likelihood that EPO is ini- 

15 tially expressed in the cytoplasm in a precursor form 
including a 27 amino acid "leader" region which is 
excised prior to entry of mature EPO into circulation. 
Potential glycosy lat ion sites within the polypeptide are 
designated by asterisks.'"* The estimated molecular weight 

20 of the translated region was determine to be 21,117 

daltons .and the M.W. of the 165 residues of the polypep- 
tide constituting mature monkey EPO was determined to be 
18,236 daltons, 

25 
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The polypeptide sequence of Table V may readily 
be subjected to analysis for the presence of highly 
hydrcphilic regions and/or secondary conformational 
characteristics indicative of potentially highly immuno- 
5 genie regions by, e.g., the methods of Hopp, et al., 
P,N.A,5. [U.S.A.) , 78, pp. 3824-3828 (1981) and Kyte et 
al., J.Mol.Biol. , 157 , pp. 105-132 (1982) and/or Chou, et 
al., BiochefjK, 13, pp. 222-245 (1974) and Advances in 
Enzvmology , 47 , pp. 45-47 (1978). Computer-assisted ana-- 
10 lysis according to the Hopp, et al. method is available 
by means of a program designated PEP Reference Section 
6.7 made available by Intelligenet ics , Inc., 124 
University Avenue, Palo Alto, California. 

15 EXAMPLE 4 

A. Human Genomic Library 

A Ch4A phage-borne human fetal liver genomic 
library prepared according to the procedures of Lawn, et 
20 al., Cell , 13 , pp. 533-543 (1979) was obtained and main- 
tained for use in a plaque hybridization assay. 

B. Plaque Hybridization Procedures For 
Screening Human Genomic Library 

25 Phage particles were lysed and the DNAs were 

fixed on filters (50,000 plaques per filter) according to 
the procedures of Woo, Methods In Enzvmology , 68 , pp. 
389-395 (1979) except for the use of GeneScreen Plus 
filters (New England Nuclear Catalog No. NEF-976] and 

30 NZYAM plates (NaCl, 5g; MgCl2-6H20, 2 g; NZ-Amine A, lOg; 
yeast extract, 5g; casamino acids, 2 g; maltose; 2g; and 
agar, 15g per liter). 

The air-dried filters were baked at 80-C for 1 
hour and then digested with Proteinase K as described in 

35 Example 3, Part B. Prehybridization was carried out with 
a IM NaCl - 158 SDS buffer for 55'C for 4 hours or more, 
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after which the buffer was removed. Hybridization and 
post-hybridization washings were carried out as described 
in Example 3, Part 3, Both the mixture of 128 20-mer 
probes designated EPV and the mixture of 128 17-mer pro- 
5 bes of Table III (designated the EPQ mixture) were 

employed. Hybridization was carried out at 48*C using 
the EPV probe mixture. EPQ probe mixture hybridization 
was carried out at 46'C — 4 degrees below the lowest 
calculated Td for members of the mixture. Removal of the 

10 hybridized probe for rehybr idizat ion was accomplished by 
boiling with 1 x SSC - 0.1% SOS for two minutes. 
Autoradiography of the filters revealed three positive 
clones (reactive with both probe^nrl^ lures ) among the 
1,500,000 phage plaques screened. Verification of the 

15 positive clones as being EPO-encoding was obtained 

through DNA sequencing and electron micrographic visuali- 
zation of heteroduplex formation with the monkey cONA of 
Example 3. This procedure alsc gave evidence of multiple 
introns in the genomic DNA sec-^ence. 

20 

EXAMPLE 5 

Nucleotide sequence analysis of one of the posi- 
tive clones (designated XhEl) was carried out and results 
25 obtained to date are set out in Table VI. 



30 



35 
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In Table VI, the initial continuous DNA 
sequence designates a top strand of 620 bases in what is 
apoarently an untranslated sequence immediately preceding 
a translated portion of the human EPO gene. More speci- 
5 fically, the sequence appears to comprise the 5* end of 
the gene which leads up to a translated DNA region coding 
for the first four amino acids (-27 through -24) of a 
leader sequence ( presequence" ) . Four base pairs in the 
sequence prior to that encoding the beginning of the 

10 leader have not yet been unambiguously determined and are 
therefore designated by an "X". There then follows an 
intron of about 639 base pairs (439 base pairs of which 
have been sequenced and the remaining 200 base pairs of 
which are designated "I.S.") and immediately preceding a 

15 codon for glutamine which has been designated as residue 
-23 of the translated polypeptide. The exon sequence 
immediately following is seen to code for amino acid 
residues through an alanine re:idue (designated as the +1 
residue of the amino acid sequ^-^ce of mature human EPO) 

20 to the codon specifying threonine at position +26, 

whereupon there follows a second intron consisting of 256 
bases as specifically designated. Following this intron 
is an exon sequence for amino acid residues 27 through 55 
and thereafter a third intron comprising 612 base pairs 

25 commences. The subsequent exon codes for residues 56 
through 115 of human EPO and there then commences a 
fourth intron of 134 bases as specified. Following the 
fourth intron is an exon coding for residue Nos. 116 
through 166 and a "stop*' codon (TGA). Finally, Table VI 

30 identifies a sequence of 568 base pairs in what appears 
to be an untranslated 3' region of the human EPO gene, 
two base pairs of which ("X") have not yet been unam- 
biguously sequenced. 

Table VI thus serves to identify the primary 

35 structural conformation (amino acid sequence) of mature 
human EPO as including 166 specified amino acid residues 
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(estimated M,W. = 13,399). Also revealed in the Table is 
the DNA sequence coding for a 27 residue leader sequence 
along with 5* and 3' ONA sequences which may be signifi- 
cant to promoter/operator functions of the human gene 
5 operon. Sites for potential glycosylation of the mature 
human EPO polypeptide are designated in the Table by 
'* asterisks. It is worthy of note that the specific amino 

^ acid sequence of Table VI likely constitutes that of a 

" naturally occurring allelic form of human erythropoietin. 

10 Support for this position is found in the results of con- 
tinued efforts at sequencing of urinary isolates of human 
erythropoietin which provided the finding that a signifi- 
cant number of erythropoietin molecules therin have a 
methionine at residue 126 as opposed to a serine as shown 

15 in the Table. 

Table VII, below, illustrates the extent of 
polypeptide sequence homology between human and monkey 
EPO. In the upper continuous line of the Table, single 
letter designations are employed to represent the deduced 

20 translated polypeptide sequences of human EPO commencing 
with residue -27 and the lower continuous line shows the 
deduced polypeptide sequence of monkey EPO commencing at 
assigned residue number -27, Asterisks are employed to 
highlight the sequence homologies. It should be noted 

25 that the deduced human and monkey EPO sequences reveal an 
"additional" lysine (K) residue at (human) position 116. 
Cross-reference to Table VI indicates that this residue 
^ is at the margin of a putative mRNA splice junction in 

the genomic sequence. Presence of the lysine residue in 

30 the human polypeptide sequence was further verified by 

It 

sequencing of a cDNA human sequence clone prepared from 
mRNA isolated from COS-1 cells transformed with the human 
genomic ONA in Example 7, infra . 



35 
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EXAMPLE 6 

The expression system selected for initial 
attempts at microbial synthesis of isolatable quantities 
5 of EPO polypeptide material coded for by the monkey cDNA 

provided by the procedures of Example 3 was one involving 
mammalian hos€ cells (i,e., COS-1 cells, A.T.C.C. No. 
.CRL-1650). The cells were transfected with a "shuttle" 
vector capable of autonomous replication in E . coli host 

10 (by virtue of the presence of pBR322-der i ved ONA) and the 
mammalian hosts (by virtue of the presence of SV40 virus- 
derived ONA). 

More specifically, an expression vector was 
constructed according to the following procedures. The 

15 plasmid clone 83 provided in Example 3 was amplified in 
E .col i and the approximately 1.4kb monkey EPO-encoding 
DNA was isolated by Eco RI and Hind lll digestion. 
Separately isolated was an approximately 4.0 kb, 
Hind lll/ Sal l fragment from pBR322. An approximately 30 

20 bp, Eco RI/ Sal I "linker" fragment was obtained from 
M13mpl0 RF DNA (P and L Laboratories). This linker 
included, in series, an Eco RI sticky end, followed by 
Sst I , Sma I , Bam Hl and Xba l recognition sites and a Sai l 
sticky end. The above three fragments were ligated to 

25 provide an approximately 5,4 kb intermediate plasmid 

("pERS") wherein the EPO DNA was flanked on one side by a 
"bank" of useful restriction endonuclease recognition 
sites. pERS was then digested with Hindlll and Sal I to 
yield the EPO DNA and the EcoRI to Sal^I (MlJmplO) linker. 

30 The 1.4 kb fragment was ligated with an approximately 4.0 
kb BamHi/Sall of pBR322 and another Ml3mplO Hind lll/BamHI 
RF fragment linker also having approximately 30 bp. The 
M13 linker fragment was characterized by a Hind lll sticky 
end, followed by Pst I , Sai l , Xbal recognition sites and a 

35 Bam HI sticky end. The ligation product was, again, a 

useful intermediate plasmid ("pBR-EPO") including the EPO 
DNA flanked on both sides by banks of restriction site. 
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The vector chosen for expression of the EPO DNA 
in CGS-l cells ("pDSVLl") had previously been constructed 
to allow for selection and autonomous replication in 
E .coli . These characteristics are provided by the origin 
5 of replication and Ampicillin resistance gene DNA sequen- 
ces present in the region spanning nucleotides 2448 
through 4362 of paR322, This sequence was structurally 
modified by the addition of a linker providing a Hind lll 
recognition immediately adjacent nucleotide 2448 prior to 
10 incorporation into the vector. Among the selected vec- 
tor's other useful properties was the capacity to autono- 
mously replicate in COS-1 cells and the presence of a 
viral promoter sequence functional iv. mammalian cells. 
These characteristics are provided by the origin of 
15 replication DNA sequence and "late gene" viral promoter 
DNA sequence present in the 342 bp sequence spanning 
nucleotide numbers 5171 through 270 of the SV40 genome. 
A unique restriction site :) ^nas provided in the 

vector and immediately adjacer't the viral promoter 
20 sequence through use of a commercially available linker 
sequence (Collaborative Research). Also .incorporated in 
the vector was a 237 base pair sequence (derived as 
nucleotide numbers 2553 through 2770 of SV40) containing 
the "late gene" viral mRNA poly-adeny lat ion signal 
25 (commonly referred to as a transcription terminator). 

This fragment was positioned in the vector in the proper 
orientation vis-a-vis the "late gene" viral promoter via 
the unique BamHI site. Also present in the vector was 
another mammalian gene at a location not material to 
30 potential transcription of a gene inserted at the unique 
BamHI site, between the viral promoter and terminator 
sequences. [The mammalian gene comprised an approxima- 
tely 2,500 bp mouse dihydrof olate reductase (DHFR) mini- 
gene isolated from plasmid pMG-1 as in Gasser, et al., 
35 P.N,A.S. (U.S.A.), 79, pp. 6522-6526, (1982).] Again, 
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the major operative components of plasmid pOSVLl comprise 
nucleotides through 4362 of pBR322 along with 

nucleotides 5171 through 270 (342bp) and 2553 through 
2770 (237bp) of SV40 DNA. 
5 Following procedures described, e.g., in 

Maniatis, et al., supra , the EPO-encoding DNA was iso- 
lated from plasmid pBR-EPO as a Bam HI fragment and 
ligated into plasmid pDSVLl cut with BamHI. Restriction 
enzyme analysis was employed to confirm insertion of the 

10 EPO gene in the correct orientation in two of the 

resulting cloned vectors (duplicate vectors H and L). 
See Figure 2, illustrating plasmid pDSVL-MkE. Vectors 
with EPO genes in the wrong orientation were saved for 
use as negative controls in transfection experiments 

15 designed to determine EPO expression levels in hosts 
transformed with vectors having EPO DNA in the correct 
orientation. 

Vectors H, L, F, x and G were combined with^ 
carrier ONA (mouse liver and spleen DNA) were employed to 

20 * transfect duplicate 60mm plates by calcium phosphate 
microprecipitate methods. Duplicate 60 mm plates were 
also transfected with carrier ONA as a "mock" transfor- 
mation negative control. After five days all culture 
■media were tested for the presence of polypeptides 

25 possessing the immunological properties of naturally- 
occurring EPO. 

EXAMPLE 7 

30 A. Initial EPO Expression System 

Involving C05-1 Cells 

The system selected for initial attempts at 
microbial synthesis of isolatable quantities of human EPO 
polypeptide material coded for by the human genomic ONA 
35 EPO clone, also involved expression in mammalian host 
cells (i.e., COS-1 cells, A.T.C.C. No. CRL-1650). The 
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human EPO gene was first sub-cloned into a "shuttle" vec- 
tor Which is capable of autonomous replication in both 
E > col i hosts (by virtue of the presence of pBR322 derived 
DNA) and in the mammalian cell line COS-1 [by virtue of 
5 the presence of SV40 virus derived DNA). The shuttle 
vector, containing the EPO gene, was then transfected 
into COS-1 cells. EPO polypeptide material was produced 
in the transfected cells and secreted into the cell 
culture media. 

10 More specifically, an expression vector was 

constructed according to the following procedures, DNA 
isolated from lambda clone XhEl, containing the human 
genomic EPO gene, was digested "with"^ ^am HI and Hind lll 
restriction endonucleases , and a 5.6 Kb DNA fragment 

15 known to contain the entire EPO gene was isolated. This 
fragment was mixed and ligated with the bacterial plasmid 
pUCS (3ethesaa Research Laboratories, Inc.) which had 
been similarly digested, crea^:-^.g the intermediate 
plasmid "pUC8-HuE", prcviaing^ convenient source of this 

20 restriction fragment. 

The vector chosen for expression of the EPO DNA 
in COS-1 cells (pSV4SEt) had previously been constructed, 
Plasmid pSV4SEt contained DNA sequences allowing selec- 
tion and autonomous replication in E . col i . These charac- 

25 teristics are provided by the origin of replication and 
Ampicillin resistance gene DNA sequences present in the 
region spanning nucleotides 2443 through 4362 of the bac- 
terial plasmid paR322. This sequence was structurally 
modified by the addition of ,a linker providing a Hind lll 

30 recognition site immediately adjacent to nucleotide 2448. 
Plasmid pSV4SEt was also capable of autonomous replica- 
tion in COS-1 cells. This characteristic was provided by 
a 342 bp fragment containing the SV40 virus origin of 
replication (nucleotide numbers 5171 through 270). This 

35 fragment had been modified by the addition of a linker 
providing an Eco Rl recognition site adjacent to 
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nucleotide 270 and a linker providing a Sai l recognition 
site adjacent nucleotide 5171, A 1061 bp fragment of 
SV4G .was also present in this vector (nucleotide numbers 
1711 through 2772 plus a linker providing a Sai l recogni- 
5 tion site next to nucleotide number 2772). Within this 
fragment was an unique Bam HI recognition sequence. In 
summary, plasmid pSV4SEt contained unique BamHI and 
Hind lll recognition sites, allowing insertion of the 
human EPO gene, sequences allowing replication and selec- 

10 tion in £ .coli , and sequences allowing replication in 
COS-1 cells. 

In order to insert the EPO gene into p5V4SEt, 
plasmid pUC8-HuE was digested with Bam HI and Hind lll 
restriction endonucleases and the 5.6 kb EPO encoding ONA 

15 fragment isolated, pSVASEt was also digested with BamHl 
and Hind lll and the major 2513 bp fragment isolated 
(preserving all necessary functions). These fragments 
were mixed and ligated, creating the final vector 
"pSVgHuEPO" . (See, Figure 3.) This vector was propa- 

20 gated in E .coli and vector ONA isolated. Restri'ction 

enzyme analysis was employed to confirm insertion of. the 
EPO gene. 

• Plasmid pSVgHuEPO DNA was used to express human 
EPO polypeptide material in COS-1 cells. More specifi- 

25 cally, pSVgHuEPO ONA was combined with carrier ONA and 
transfected into triplicate -60 mm plates of COS-1 cells. 
As a control, carrier ONA alone was also transfected into 
COS-1 cells. Cell culture media were sampled five and 
seven days later and tested for the presence of polypep- 

30 tides possessing the immunological properties of 
naturally occurring human EPO. 

B. Second EPO Expression System 

Involving C05-1 Cells 

35 Still another system was designed to provide 

improved production of human EPO polypeptide material 
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coded Dy the human genomic DNA EPO clone in COS-1 cells 
CA.T.CC. No, CRL-1650 ) . 

In the immediately preceding system, EPO was 
expressed in COS-1 cells using its own promoter which is 
5 within the 5,6 Kb Bam Hl to Hind lll restriction fragment. 
In the following construction, the EPO gene is altered so 
that it is expressed using the SVAO late promoter. 

More specifically, the cloned 5.6 Kb SamHI to 
Hind lll genomic human EPO restriction fragment was 

10 modified by the following procedures. Plasmid pUC8-HuE, 
as described above, was cleaved with Bam Hl and with 
BstE II restriction endonucleases . B-'E II cleaves within 
the 5.6 Kb EPO gene at a position whj ch is 44 base pairs 
5* to the initiating ATG coding for tne pre-peptide and 

15 approximately 680 base pairs 3* to the Hind lll restric- 
tion site. The approximately 4900 base pair fragment was 
isolated. A synthetic linker DNA fragment, containing 
Sai l' and BstE II sticky ends an:- an internal Bam Hl 
recognition site was synthesize: and purified. The two 

20 fragments were mixed and ligatec .-^ith plasmid p5R322 
which had been cut with Sai l and Bam Hl to produce the 
intermediate plasmid pBRgHE. The genomic human EPO gene 
can be isolated therefrom as a 4900 base pair BamHl 
digestion fragment carrying the complete structural gene 

25 with a single ATG 44 base pairs 3' to BamHl site adjacent 
the amino terminal coding region. 

This fragment was isolated and inserted as a 
Bam Hl fragment into Bam Hl cleaved expression vector 
plasmid pOSVLl (described in Example 6). The resulting 

30 plasmid, pSVLcHuEPO, as illustrated in Figure 4, was used 
to express EPO polypeptide material from COS-1 cells, as 
described in Examples 6 and 7A . 



EXAMPLE 8 

35 

Culture media from growth of the six transfected 
COS-1 cultures of Example 6 were analyzed by radioim- 



wo 85/02610 



^^US84/02021 



- 57 - 

munoassay according to the procedures set forth in 
Example 2, Part B. Each sample was assayed at 250, 125, 
50, and 25 microliter aliquot levels, Supernatants from 
growth of cells mock transfected or transfected with vec- 
5 tors having incorrect EPO gene orientation were unam- 
biguously negative for EPO immunor eact iv it y . For each 
sample of the ^two supernatants derived from growth of 
COS-1 cells transfected with vectors (H and L) having the 
EPO DNA in the correct orientation, the % inhibition of 
10 -^^I-EPO binding to antibody ranged from 72 to 88%, which 
places all values at the top of the standard curve. The 
exact concentration of EPO in the culture supernatant 
could not then reliably be estimated. A quite conser- 
vative estimate of 300 mU/ml was made, however, from the 
15 value calculation of the largest aliquot size (250 
microliter). 

A representative culture fluid according to 
Example 6 and five and seven day culture fluids obtained 
according to Example 7A were tested in the RIA in order 
20 to compare activity of recombinant monkey and human EPO ■ 
materials to a naturally-occurring human EPO standard and 
the results are set out in graphic form in Figure 1. 
Briefly, the results expectedly revealed that the recom- 
binant monkey EPO significantly competed for anti-human 
25 EPO antibody although it was not able to completely inhi- 
bit binding under the test conditions. The maximum per- 
cent inhibition values for r.ecomDinant human EPO, 
however, closely approximated those of the human EPO 
standard. The parallel nature of the dose response 
30 curves suggests immunological identity of the sequences 
(epitopes) in common. Prior estimates of monkey EPO in 
culture fluids were re-evaluated at these higher dilution 
levels and were found to range from 2.91 to 3.12 U/ml. 
Estimated human EPO production levels were correspon- 
ds dingly set at 392 mU/ml for the five-day growth sample 
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and 567 mU/ml for the seven day growth samcle. Estimated 
monkey E?0 production levels in the Example 7B expression 
system were on the same order or better. 

5 EXAMPLE .9 

Culture fluids prepared according to Examples 6 
and 7 were subjected to an ijn vitro assay for EPO acti- 
vity according to the procedure of Goldwasser, et al., 

10 Endocrinology , 97, 2, pp. 315-323 (1975). Estimated 

monkey EPO values for culture fluids tested ranged from 
3-2 to 4.3 U/ml. Human EPO culture -^luids were also 
active in this in vitro assay and ~ ""^'.r ther , this activity 
could be neutralized by anti-EPO antibody. The recom- 

15 binant monkey EPO culture fluids according to Example 6 
were also subjected to an assay for In vivo biological 
activity according to the general procedures of Cotes, et 
al-, Nature , 191 , pp. 1065-10'? (1961) and Hammond, et 
al., Ann. N.y. Acad. Sci. , 149, p: . 516-527 (1968) and acti- 

20 vity levels ranged from 0.94 to 1.24 U/ml. 

EXAMPLE 10 

In the previous examples, recombinant monkey or 
25 human EPO material was produced from vectors used to 

transfect COS-1 cells. These vectors replicate in COS-1 
cells due to the presence of 5V40 T antigen within the 
cell and an SV40 origin of replication on the vectors. 
Though these vectors produce useful quantities of EPO in 
30 COS-1 cells, expression is only transient (7 to 14 days) 
due to the eventual loss of the vector. Additionally, 
only a small percentage of COS-1 became productively 
transfected with the vectors. The present example 
describes expression systems employing Chinese hamster 
35 ovary (CHO) OHFR' cells and the selectable marker, DHFR. 
[For discussion of related expression systems, see 
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U S Letters Patent No. 4,399,216 and European Patent 
Applications 117058, 117059 and 117060, all published 

August 29, 1984.] 

• CHO OHFR- cells (DuX-811) CHO Kl cells, Urlaub, 
5 et al., Proc^ Acad^ Sci^ (U-S.A.), Vol. 77, 4461 

(1980) lack the enzyme dihydrof olate reductase CDHFR) due 
to mutations in the structural genes and therefore 
require the presence of glycine, hypoxanthine , and thymi- 
dine in the culture media. Plasmids pOSVL-MKE (Example 
10 6) or pDSVL-gHuEPO (Example 7B) were transfected along 
with carrier ONA into CHO OHFR" cells growing in media 
containing hypoxanthine, thymidine-, and glycine in 60 mm 
culture plates. Plasmid pSVgHuEPO (Example 7A) was mixed 
with the plasmid pMG2 containing a mouse dihydrof olate 
15 reductase gene cloned into the bacterial plasmid vector 
PBR322 (per Gasser, et al.. su£ra . ) The plasmid mixture . 
and carrier ONA was transfected into CHO OHFR cells. 
(Cells which acquire one plasmid will generally also 
acquire a second plasmid). After three days, the cells 
20 were dispersed by tr ypsinization into several IOC mm 

culture plates in media lacking hypoxanthine and thymi- 
dine. Only those cells which have been stably trans- 
formed with the OHFR gene, and thereby the EPO gene, 
survive in this media. After 7-21 days, colonies of sur- 
25 viving cells became apparent. These transformant colo- 
nies, after dispersion by ttypsinization can be 
continuously propagated in media lacking hypoxanthine and 
thymidine, creating new cell strains (e.g., CHO 
pDSVL-MkEPO, CHO pSVgHuEPO, CHO-pDSVL-gHuEPO ) . 
30 Culture fluids from the above cell strains were 

tested in the RIA for the presence of recombinant monkey 
or human EPO. Media for strain CHO pOSVL-MkEPO contained 
EPO with immunological properties like that obtained from 
COS-1 cells transfected with plasmid pDSVL-MkEPO. A 
35 representative 65 hour culture fluid contained monkey EPO 
at 0.60 U/ml. 
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Culture fluids from CHO pSVgHuEPO and CHO , 
pDSVL-gHuEPO contained recombinant human EPO with immuno- 
logical properties like that obtained with COS-1 cells 
transfected with plasmid pSVgHuEPO or pDSVL-gHuEPO . A 
5 representative 3 day culture fluid from CHO pSVgHuEPO 
con-tained 2.99 U/ml of human EPO and a 5.5 day sample 
from CHO pDSVL-gHuEPO had 18.2 U/ml of human EPO as 
measured by the RIA. 

The quantity of EPO produced by the cell strains 

10 described above can be increased by gene amplification 
giving new cell strains of greater productivity. The 
enzyme dihydrof olate reductase (DHFR' which is the pro- 
duct coded for by the OHFR gene can"te inhibited by the 
drug methotrexate (MTX). More specif ically , cells propa- 

15 gated in media lacking hypoxanthine and thymidine are 
inhibited or killed by MTX. Under the appropriate con- 
ditions, (e*g., minimal concentrations of MTX) cells 
resistant to and able to grew i MTX can be obtained. 
These cells are found to be re^istent to MTX due to an 

20 amplification of the number of their OHFR genes, result- 
ing in increased production of OHFR enzyme. The sur- 
viving cells can, in turn, be treated with increasing 
concentrations of MTX, resulting in cell strains con- 
taining greater numbers of OHFR genes. ^'Passenger genes" 

25 (e.g., EPO) carried on the expression vector along with 
the OHFR gene or transformed with the OHFR gene are fre- 
quently found also to be increased in their gene copy 
number. 

As examples of practice of this amplification 
30 system, cell strain CHO pDSVL-MkE was subjected to 

increasing MTX concentrations (0 nM, 30 nM and 100 nM). 
Representative 65-hour culture media samples from each 
amplification step were assayed by RIA and determined to 
contain 0.60, 2.45 and 6.10 U/ml, respectively. Cell 
35 strain CHO pDSVL-gHuEPO was subjected to a series of 
increasing MTX concentrations of 30 nM, 50 nM, 100 nM, 
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200 nM, 1 uM, and 5 uM MTX, A representative 3-day 
culture media sample from the 100 nM MTX step contained 
human EPO at 3089 i 129 u/ml as judged by RIA. 
Representative hour cultural medium samples from the 
5 100 nM and 1 yM MTX steps contained, respectively, human 
EPO at A66 and 1352 U/ml as judged by RIA (average of 
triplicate assays). In these procedures, 1 x 10^ cells 
were plated in 5 ml of media in 60 mm culture dishes. 
Twenty-four hours later the media were removed and 
10 replaced with 5 ml of serum-free media (high glucose OMEM 
supplemented with 0.1 mM non-essential amino acids and 
L-glutamine ) . EPO was allowed to accumulate for 48 hours 
in the serum-free media. The media was collected for RIA 
assay and the cells were trypsinized and counted. The 
15 average RIA values of 467 U/ml and 1352 U/ml for cells 
grown at 100 nM and 1 uM MTX, respectively, provided 
actual yields of 2335 U/plate and 6750 U/plate. The 
average cell numbers per plate were 1.94 x 10^ and 
3.12 X 10^ cells, respectively. The effective production 
20 rates for these culture conditions were thus 1264 and 
2167 U/10^ cells/48 hours. 

The cells in the cultures described immediately 
above are a genetically heterogeneous population. 
Standard screening procedures are being employed in an 
25 attempt to isolate genetically hemogeneous clones with 
the highest production capacity. See, Section A, Part 2, 
of "Points to Consider in the Characterization of Cell 
Lines Used to Produce Biologies", June 1, 1984, Office of 
Biologies Research Review, Center for Drugs and 
30 Biologies, U.S. Food and Drug Administration. 

The productivity of the EPO producing CHO cell 
lines described above can be improved by appropriate cell 
culture techniques. The propagation of mammalian cells 
in culture generally requires the presence of serum in 
35 the growth media. A method for production of erythro- 
poietin from CHO cells in media that does not contain 
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serum grsatly facilitates the purification of erythro- 
poietin from the culture medium. The method described 
below is capable of economically producing erythropoietin 
in serum-free media in large quantities sufficient for 
5 production. 

Strain CHO pOSVL-gHuEPO cells, grown in standard 
cell culture conditions, are used to seed spinner cell 
culture flasks. The cells are propagated as a suspension 
cell line in the spinner cell culture flask in media con- 

10 sisting of a 50-50 mixture of high glucose DMEM and Ham's 
F12 supplemented with 3% fetal calf serum, L-gluta- 
mine, Penicillin and Streptomycin, 0.05 mM non-essential 
amino acids and the appropriate concentration of metho- 
trexate. Suspension cell culture allows the EPO-produc- 

15 ing CHO cells to be expanded easily to large volumes. 
CHO cells, grown in suspension, are used to seed roller 
bottles at an initial seeding density of 1.5 x lo"^ viable 
cells per 850 cm^ roller bottl:. in 200 ml of media. The 
cells are allowed to grow to c:^fluency as an adherent 

20 cell line over a three-day period. The media used for 
this phase of the growth is the same as used for growth 
in suspension. At the end of the three-day growth 
period, the serum containing media is removed and 
replaced with 100 ml of serum-free media; 50-50 mixture 

25 of high glucose DMEM and Ham's F12 supplemented with 0.05 
mM non-essential amino acids and L-glutamine. The 
roller bottles are returned to the roller bottle incuba- 
tor for a period of 1-3 hours and the media again is 
removed and replaced with 100 ml of fresh serum-free 

30 media. The 1-3 hour incubation of the serum-free media 
reduces the concentration of contaminating serum pro- 
teins. The roller bottles are returned to the incubator 
for seven days during which erythropoietin accumulates in 
the serum-free culture media. At the end of the seven- 

35 day production phase, the conditioned media is removed 
and replaced with fresh serum-free medium for a second 
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production cycle. As an example of the practice of this 
production system, a representative seven-day, serum-free 
media sample contained human erythropoietin at 3892+409 
U/ml as judged by the RIA. Based on an estimated cell 

5 density of 0.9 to 1.8 x 10^ cells/cm^, each 850 

n 8 
cm"^ roller bottle contained from 0.75 to 1.5 x 10 cells 

and thus the rate of production of EPO in the 7-day, 100 

ml culture was 750 to 1470 U/10^ cells/48 hours. 

Culture fluids from cell strain CHO pDSVL-MkEPO 

10 carried in 10 nM MTX were subjected to RIA in vitro and 
in vivo EPO activity assays. The conditioned media 
sample contained 41.2 ± 1.4 U/ml of MkEPO as measured by 
the RIA, 41.2 ± 0.064 U/ml as measured by the ^n vitro 
biological activity assay and 42.5 + 5 U/ml as measured 

15 by the i£ vivo biological activity assay. Amino acid 

sequencing of polypeptide products revealed the presence 
of EPO products, a principle species having 3 residues of 
the "leader" sequence adjacent the putative amino ter- 
minal alanine. Whether this is the result of incorrect 

20 membrane processing of the polypeptide in CHO cells or 
reflects a difference in structure of the amino terminus 
of monkey EPO vis-a-vis human EPO, is presently unknown. 

Culture fluids from cell strain CHO pOSVL-gHuEPO 
were subjected to the three assays. A 5.5 day sample 

25 contained recombinant human EPO in the media at a level 
of 18.2 U/ml by RIA assay, 1-5.8 ± 4.6 U/ml by ui vitro 
assay and 16.8 ± 3.0 U/ml by in vivo assay. 

Culture fluid from CHO pOSVL-gHuEPO cells pre- 
pared amplified by stepwise 100 nM MTX were subjected to 

30 the three assays. A 3.0 day sample contained recombinant 
human EPO at a level of 3089 ± 129 U/ml by RIA, 2589 ± 
71.5 U/ml by in vitro assay, and 2040 t 160 U/ml by in 
vivo assay. Amino acid sequencing of this product 
reveals an amino terminal corresponding to that 

35 designated in Table VI. 

Cell conditioned media from CHO cells trans- 
fected with plasmid pDSVL-MkE in 10 nM MTX were pooled. 
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and the MTX dialyzed out over several days, resulting in 
media with an EPO activity of 221 + 5.1 U/ail (EPO-CCM). 
To determine the in vivo effect of ^the EPO-CCM uoon hema- 
tocrit levels in normal Balb/C mice, the following 
5 experiment was conducted. Cell conditioned media from 
untransfected CHO cells (CCM) and EPO-CCM were adjusted 
with PBS. CCM was used for the control group (3 mice) 
and two dose levels of EPO-CCM -- 4 units per injection 
and 44 units per injection -- were employed for the 

10 experimental groups (2 mice/group). Over the course of 5 
weeks, the seven mice were injected intraper itoneally , 3 
times per week. After the eighth in;ection, average 
hematocrit values for the control grc'..p were determined 
to be 50 .456; for the 4U group, 55.1*; and, for the 44U 

15 group, 67.9%. 

Mammalian cell expression products may be 
readily recovered in substantially purified form from 
culture media using HPLC CC^) : oloying an ethanol gra- 
dient, preferably at pH7. 

20 A preliminary attempt was made to characterize 

recombinant glycoprotein products from conditioned medium 
COS-1 and CHO cell expression of the human EPO gene in 
comparison to human urinary EPO isolates using both 
Western blot analysis and SDS-PAGE. These studies indi- 

25 cated that the CHO-produced EPO material had a somewhat 
higher molecular weight than the COS-1 expression product 
which, in turn, was slightly larger than the pooled 
source human urinary extract. All products were somewhat 
heterogeneous. Neuraminidase enzyme treatment to remove 

30 sialic acid resulted in COS-1 and CHO recombinent pro- 
ducts of approximately equal molecular weight which were 
both nonetheless larger than the resulting asialo human 
urinary extract. Endoglycosidase F enzyme (EC 3.2.1) 
treatment of the recombinant CHO product and the urinary 

35 extract product (to totally remove carbohydrate from 
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both) resulted in substantially homogeneous products 
having essentially identical molecular weight charac- 
teristics. 

Purified human urinary EPO and a recombinant, 
5 CHO cell-produced, EPO according to the invention were 
subjected to carbohydrate analysis according to the pro- 
cedure of Ledeen, et al. Methods in Enzymoloqy , 
83(Part D) , 139-191 (1982) as modified through use of the 
hydrolysis procedures of Nesser, et al., Anal ^Biochem . , 

10 142 , 58-67 (198A). Experimentally determined car- 
bohydrate constitution values (expressed as molar ratios 
of carbohydrate in the product) for the urinary isolate 
were as follows: Hexoses, 1.73; N-acetylglucosamine , 1; 
N-acetylneuraminic acid, 0.93; Fucose, 0; and N-acetyl- 

15 galactosamine , 0. Corresponding values for the recom- 
binant product (derived from CHO pOSVL-gHuEPO 3-day 
culture media at 100 nM MTX) were as follows: Hexoses, 
15.09; N-ace't y Iglucosamine , 1; N-acetylneuraminic acid, 
0.998; Fucose, 0; and N-ace tylgalactosamine , 0. These 

20 findings are consistent with the Western blot and 
SOS-PAGE analysis described above. 

Glycoprotein products provided by the present 
invention are thus comprehensive of products having a 
primary structural conformation sufficiently duplicative 

25 of that of a naturally-occurring erythropoietin to allow 
possession of one or more of the biological properties 
thereof and having an average carbohydrate composition 
which differs from that of naturally-occurring erythro- 
poietin . 

30 EXAMPLE 11 

The present example relates to the total manu- 
facture by assembly of nucleotide bases of two structural 
genes encoding the human species EPO sequence of Table VI 
35 and incorporating, respectively "preferred" codons for 
expression in E .col i and yeast C S.cerevisiae ) cells. 
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A^.so descrit:ed is the construction of genes encoding ana- 
llgs Of hu.an EPO. Briefly stated, the protocol employed 
was Generally as set out in the previously noted disclo- 
sure'of Alton, et al . (WO 83/04053). The genes were 
5 designed for initial assembly of component oligonucleo i- 
des into multiple duplexes which, in turn, were assembled 
into three discrete sections. These sections were 
designed for ready amplification and, upon removal rom 
the amplification system, could be assembled sequentially 
10 or through a multiple fragment ligation in a suitable 

expression vector. 

Tables VIII through XIV below illustrate the 
design and assembly of a manufactured gene encoding a 
human EPO translation product lacki:- 3 any leader or pre- 
15 seauence but including an initial methionine residue at 
oosition -1. Moreoever, the gene incorporated in 
substantial part E^ preference codons and the 
construction was therefore referred to as the ECEPO 



gene 



20 



25 



30 
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TABLE VIII 
ECEPO SECTION 1 OLIGONUCLEOTIDES 

1 . AATTCTAGAAACCATGAGGGTAATAAAATA 

2. CCATTATTTTATTACCCTCATGGTTTCTAG 
5 3. ATGGCTCCGCCGCGTCTGATCTGCGAC 

4. CTCGAGTCGCAGATCAGACGCGGCGGAG 

5. TCGAGAGTTCTGGAACGTTACCTGCTG 

6. CTTCCAGCAGGTAACGTTCCAGAACT 

7. GAAGCTAAAGAAGCTGAAAACATC 
10 8. GTGGTGATGTTTTCAGCTTCTTTAG 

9. ACCACTGGTTGTGCTGAACACTGTTC 

10. CAAAGAACAGTGTTCAGCACAACCA 

11. TTTGAACGAAAACATTACGGTACCG 

12. GATCCGGTACCGTAATGTTTTCGTT 

15 

TABLE IX 



ECEPO SECTION 1 

Xba l 

EcoRI 1 , 3 

AATTCTAG AAACCATGAG GGTAATAAAA TA |ATGG CTCC GCCGCGTCTG 
GATC TTTGGTACTC CCATTATTTT ATTACaOAGG CGGCGCAGAC 
20 2 i 



ATCTGCGAc It CGAGA GTTCT GGAACGTTAC CTGCTc bAAG CTAAAGAAGC 
TAGACGCTGA GCTCtTCAAGA CCTTGCAATG GACGACCTfq GATTTCTTCG 



TGAAAACATC BCCACTGGTT GTGCTGAACA CTGTTC(TTTG AACGAAAACA 
ACTTTTGTAG TGGTGpCCAA CACGACTTGT GACAAGAAAC] TTGCTTTTGT 
25 8 10 

Kpn l Bam Hl 
TTACGGTACC G 
AATGCCATGG CCTAG 
12 
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TA8LE X 

ECEPO SECTION 2 OLIGONUCLEOTIDES 

1. . AATTCGGTACCAGACACCAAGGT 

2. GTTAACCTTGGTGTCTGGTACCG 
5 3. TAACTTCTACGCTTGGAAACGTAT 

4. • TTCCATACGTTTCCAAGCGTAGAA 

5. GGAAGTTGGTCAACAAGCAGTTGAAGT 

6. CCAAACTTCAACTGCTTGTTGACCAAC 

7. TTGGCAGGGTCTGGCACTGCTGAGCG 
10 8. GCCTCGCTCAGCAGTGCCAGACCCT3" 

9. AGGCTGTACTGCGTGGCCAGGCA 

10. GCAGTGCCTGGCCACGCAGTACA 

11. CTGCTGGTAAACTCCTCTCAGCCGT 

12. TTCCCACGGCTGAGAGGAGTI., ACCA 
15 13. GGGAACCGCTGCAGCTGCATGTTGAC 

lA. GCTTTGTCAACATGCAGCTGCAGCGG 

15. AA'AGCAGTATCTGGCCTGAGATCTG 

16. GATCCAGATCTCAGGCCAGATACT 



20 



25 
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TABLE XII 

ECEPO SECTION 3 

1. GATCCAGATCTCTGACTACTCTGC 

5 2. ACGCAGCAGAGTAGTCAGAGATCTG 

3. TGCGTGCTCTGGGTGCACAGAAAGAGG 

4. GATAGCCTCTTTCTGTGCACCCAGAGC 

5. CTATCTCTCCGCCGGATGCTGCATCT 

6. CAGCAGATGCAGCATCCGGCGGAGA 
10 7. GCTGCACCGCTGCGTACCATCACTG ~ 

8. ATCAGCAGTGATGGTACGCAGCGGTG 

9. CTGATACCTTCCGCAAACTGTTTCG 

10. ATACACGAAACAGTTTGCGGAAGGT 

11. TGTATACTCTAACTTCCTGCaiGGTA 
15 12. CAGTTTACCACGCAGGAAGTTAGAGT 

13. AACTGAAACTGTATACTGGCGAAGC 

14. GGCATGCTTCGCCAGTATACAGTTT 

15. ATGCCGTACTGGTGACCGCTAATAG 

16. TCGACTATTAGCGGTCACCAGTAC 

20 



25 
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BamHI Bqlll 
GA TCCAGATCTCTG 
GTCTAGAGAC 



TABLE Xril 



ECEPO SECTION 3 



ACTACTCTGC 
TGATGAGACG 



GCGT GCTCT GGGTGCACAG AAAGAGG^TA_ 
CnCA|:GAGA CCCACGTGTC TTTCTCCGAT 

4 



TCTCTCCGCC 
A^GAGGCGG 



10 



7 9 

GGATGCTGCA TCThCTGCAC CGCTGCGTAC CATCACT GCT GAT ACCTTCC 

CCTACGACGT AGACGACfcTG GCGACGCATG GTAGTGACGA CTflTGGAAGG 
6^8 



11 



13 



GCAAACTGTT TCG tTGTATA C TCTAACTTCC TGCGTGGTA h ACTGA AACTG 
CGTTTGACAA AGCACATA[TG AGATTGAAGG ACGCACCATT TGACITTTGAC 
10 12 



15 



TATACTGGCG AAGC 
ATATGACCGC TTCG 
14 



GCC G 
ACG^C 



15 

TACTGGTGAC 
ATGACCACTG 
16 



Sai l 

CGCTAATAG 
GCGATTATC AGCT 
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TASLE XIV 



ECEPO GENE 



Xbal 
CTAG 



AAACCATGAG 
TTTGGTACTC 



GGTAATAAAA 
CCATTATTTT 



-1 1 
MetAla 
TAATGGCTCC 
ATTACCGAGG 



GCCGCGTCTG 
CGGCGCAGAC 



ATCTGCGACT CGAGAGTTCT GGAACGTTAC CTGCTGGAAG CTAAAGAAGC 
TAGACGCTGA GCTCTCAAGA CCTTGCAATG GACGACCTTC GATTTCTTCG 



TGAAAACATC ACCACTGGTT GTGCTGAACA CTGTTCTTTG AACGAAAACA 
ACTTTTGTAG TGGTGACCAA CACGACTTGT GACAAGAAAC TTGCTTTTGT 



10 TTACGGTACC AGACACCAAG GTTAACTTCT ' AC^-'TTGGAA ACGTATGGAA 
AATGCCATGG TCTGTGGTTC CAATTGAAGA TGCoAACCTT TGCATACCTT 



GTTGGTCAAC AAGCAGTTGA AGTTTGGCAG GGTCTGGCAC TGCTGAGCGA 
CAACCAGTTG TTCGTCAACT TCAAACCGTC CCAGACCGTG ACGACTCGCT 



GGCTGTACTG CGTGGCCAGG CACTGCTC^^T AAACTCCTCT CAGCCGTGGG 

CCGACATGAC GCACCGGTCC GTGACG.-C.A TTTGAGGAGA GTCGGCACCC 

15 

AACCGCTGCA GCTGCATGTT GACAAAGCAG TATCTGGCCT GAGATCTCTG 

TTGGCGACGT CGACGTACAA CTGTTTCGTC ATAGACCGGA CTCTAGAGAC 



ACTACTCTGC TGCGTGCTCT GGGTGCACAG AAAGAGGCTA TCTCTCCGCC 
TGATGAGACG ACGCACGAGA CCCACGTGTC TTTCTCCGAT AGAGAGGCGG 



20 GGATGCTGCA TCTGCTGCAC CGCTGCGTAC CATCACTGCT GATACCTTCC 
CCTACGACGT AGACGACGTG GCGACGCATG GTAGTGACGA CTATGGAAGG 



GCAAACTGTT TCGTGTATAC TCTAACTTCC TGCGTGGTAA ACTGAAACTG 
CGTTTGACAA AGCACATATG AGATTGAAGG ACGCACCATT TGACTTTGAC 

Sai l 

TATACTGGCG AAGCATGCCG TACTGGTGAC CGCTAATAG 
ATATGACCGC TTCGTACGGC ATGACCACTG GCGATTATCA GCT 

25 
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More particularly, Table VIII illustrates oligo- 
nucleotides employed to generate the Section 1 of the 
ECEPO gene encoding amino terminal residues of the human 
species polypeptide. Oligonucleotides were assembled 
5 into duplexes Cl. and 2^ 3 and etc.] and the duplexes 
were then ligated to provide ECEPO Section 1 as in Table 
IX. Note that the assembled section includes respective 
terminal Eco RI and Bam Hl sticky ends, that "downstream" 
of the Eco RI sticky end is a Xba l restriction enzyme 

10 recognition site; and that "upstream" of the BamHl sticky 
end is a Kpn l recognition site. Section 1 could readily 
be amplified using the M13 phage vector employed for 
verification of sequence of the section. Some dif- 
ficulties were encountered in isolating the section as an 

15 Xba l/ Kpn l fragment from RF ONA generated in E .coli , 

likely due to methylation of the Kpn l recognition site 
bases within the host. Single-stranded phage DNA was 
therefore isolated and rendered into double-stranded form 
in vitro by primer extension and the desired double- 

20 stranded fragment was thereafter readily isolated. 

ECEPO gene Sections 2 and 3 (Tables XI and XIII) 
were constructed in a similar manner from the oligo- 
nucleotides of Tables X and XII, respectively. Each 
section was amolified in the M13 vector employed for 

25 sequence verification and was isolated from phage ONA. 
As is apparent from Table XI, ECEPO Section 2 was con- 
structed with Eco RI and Bam Hl sticky ends and could be 
isolated as a Kon l/ Bgl ll fragment. Similarly, ECEPO 
Section 3 was prepared with Bam Hl and Sai l sticky ends 

30 and could be isolated from phage RF ONA as a Bgl ll/ Sal l 
fragment. The three sections thus prepared can readily 
be assembled into a continuous ONA sequence (Table XIV) 
encoding the entire human species EPO polypeptide with an 
amino terminal methionine codon (ATG) for E .coli transla- 

35 tion initiation. Note also that "upstream" of the ini- 
tial ATG is a series of base pairs substantially 
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duolicating the ribosome binding site sequence of the 
highly expressed OMP-f gene of E . col i > 

Any suitable expression vector may be employed 
to carry the ECEPO. The particular vector chosen for 
5 expression of the ECEPO gene as the "temperature sen- 
sitive" plasmid pCFM536 — a derivative of plasmid 
pCFM414 (A,T.C.C« 40076) as described in co-pending 
U.S. Patent Application Serial No. 636,727, filed August 
6, 1984, by Charles F. Morris. More specifically, 

10 pCFM536 was digested with Xba l and Hind III ; the large 

fragment was isolated and employed in a two-part ligation 
with the ECEPO gene. Sections 1 (Xbal/Kpnl), 2 
(KpnI/Balll ) and 3 (Bglll/Sall) had" creviously been 
assembled in the correct order in Ml3 and the EPO gene 

15 was isolated therefrom as a single Xba l/ Hind lll fragment. 
This fragment included a portion of the polylinker from 
M13 mD9 phage spanning the Sai l to Hind lll sites therein. 
Control of expression in the r/sulting expression 
plasmid, p536 , was by means of a lambda promoter, 

20 which itself may be under control of the ^jq^j repressor 
gene (such as provided in E . coli strain K12AHtrp]. 

The manufactured ECEPO gene above may be 
variously modified to encode erythropoietin analogs such 
as [Asn^, des-Pro^ through Ile^]hEPO and [His'^lhEPO, as 

25 described below. 

A . [Asn^, des-Pro^ through Ile^l h£PO 

Plasmid 536 carrying the ECEPO manufactured gene 
of Table XIV as a Xba l to Hind lll insert was digested 
30 with Hind lll and Xho l . The latter endcnuclease cuts the 
ECEPO gene at a unique, 6 base pair recognition site 
spanning the last base of the codon encoding Asp through 
the second base of the Arg'''^ codon. A Xba l/ Xho l "linker" 
sequence was manufactured having the following sequence: 

35 
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Xbal 

Met 

S'-CTAG ATG 
3' -TAG 



+ 12 7 

Ala Asn Cys 

GCT AAT TGC 

CGA TTA ACG 



8 9 
Asp Xho l 
GAC-3' 
CTG AGCT-5' 



The Xbal/Xhol linker and the Xhol/ Hind lll ECEPO 
5 gene sequence fragment were inserted into the large 
fragment resulting from Xba l and Hind lll digestion of 
plasmid pCFM526 a derivative of plasmid pCFMAl4 
CA.T.C.C. 40076) as described in co-pending 
U.S. Patent Application Serial No. 636,727, filed August 
10 6, 1984, by'charles F. Morris, to generate a plasmid- 
borne DNA sequence encoding E . coli expression of the 
Met*"^ form of the desired analog. 

B. ['His^l hEPO 

15 Plasmid 536 was digested with Hind lll and Xhol 

as in part A above. A Xba l/ Xho l linker was manufactured 
having the following sequence: 



Xba l +12 3 4 

Met Ala Pro Pro Arg 

20 S'-CTAG ATG GCT CCG CCA CGT 

3' -TAG CGA GGC GGT GCA 



5 6 7 8 9 Xhol 
Leu He His Asp 
CTG ATG CAT GAC.3' 
GAC TAG GTA CTG AGCT-5' 



The linker and the Xho l/ Hind lll ECEPO sequence 
fragment were then inserted into pCFM526 to generate a 
plasmid-borne DNA sequence encoding E.coli expression of 

25 the Met"''' form of the desired analog. 

Construction of a manufactured gene (*'SC£PO'') 
incorporating yeast preference codons is as described in 
the following Tables XV through XXI. As was the case 
with the ECEPO gene, the entire construction involved 

30 formation of three sets of oligonucleotides [Tables XV, 
XVII and XIX) which were formed into duplexes and 
assembled into sections (Tables XVI, XVIII and XX). Note 
that synthesis was facilitated in part by use of some 
sub-optimal codons in both the SCEPO and ECEPO construe- 
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ticns, i.e., oligonucleotides 7-12 of Section 1 of both 
genes were identical, as were oligonucleotides 1-6 cf 
Section 2 in each gene. 



10 



15 



20 



25 



30 
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TABLE XV 

SCEPO SECTION 1 0LIG0NUCLS0TICE5 

1. AATTCAAGCTTGGATAAAAGAGCT 

5 2. GTGGAGCTCTTTTATCCAAGCTTG 

3. CCACCAAGATTGATCTGTGACTC 

4. TCTCGAGTCACAGATCAATCTTG 

5. GAGAGTTTTGGAAAGATACTTGTTG 

6. CTTCCAACAAGTATCTTTCCAAAAC 
10 7. GAAGCTAAAGAAGCTGAAAACATC 

8. GTGGTGATGTTTTCAGCTTCTTTAG 

9. ACCACTGGTTGTGCTGAACACTGTTC 

10. CAAAGAACAGTGTTCAGCACAACCA 

11. TTTGAACGAAAACATTACGGTACCG 
15 12. GATCCGGTACCGTAATGTTTTCGTT 

TABLE XVI 
SCEPO SECTION 1 

EcoRI Hind II I !_ 
AATTCA AGCTTGGATA 
20 GT TCGAACCTAT 

2 

AAAGAGCT bc ACC AAgItTG ATCTGTGACT c|gAGAGTTTT 
TTTCTCGAGG TGGTTCTAAC TAGACACTGA GCTCTjCAAAA 

4 



5 , 7 

AAAGATAC TTGTTGEAAG CTAAAGAAGC TGAAAACATC I^CCACTGGTT 
TTTCTATG AACAACCTTc] GATTTCTTCG ACTTTTGTAG TGGTGjACCAA 



5 

GGA/ 
25 CC- 

6 ' 8 

9 U, Kon I Bam HI 

GTGCTGAACA CTGTTc ItTTG AACGAAAACA TTACGGTACC G 

CACGACTTGT GACAAGAAACI TTGCTTTTGT AATGCCATGG CCTAG 

12 
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tasl; XVII 

SC£?0 SECTION 2 OLIGONUCLEOTIDES 

I. AATTCGGTACCAGACACCAAGGT 
5 2. GTTAACCTTGGTGTCTGGTACCG 

3. TAACTTCTACGCTTGGAAACGTAT 

4. TTCCATACGTTTCCAAGCGTAGAA 

5. GGAAGTTGGTCAACAAGCAGTTGAAGT 

6. CCAAACTTCAACTGCTTGTTGACCAAC 
10 7. TTGGCAAGGTTTGGCCTTGTTATCTG 

8. GCTTCAGATAACAAGGCCAAACCTTG 

9. AAGCTGTTTTGAGAGGTCAAGCCT 
10. AACAAGGCTTGACCTCTCAAAACA 

II. TGTTGGTTAACTCTTCTCAACCATGGG 

15 12. TGGTTCCCATGGTTGAGAAGAGTTAACC - 

13. AACCATTGCAATTGCACGTCGAT 

14. CTTTATCGACGTGCAATTGCAA 

15. AAAGCCGTCTCTGGTTTGAGATCTG 

16. GATCCAGATCTCAAACCAGAGACGG 

20 



25 
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TABLE XVIII 
SCEPC SECTION 2 



. Kpn l 
EcoRI 1 

FTTTCGGTACC AGACACCAAG 
■ 5 GCCATGG TCTGTGGTTC 

2 

gt Itaact tct ACGCTTGGAA ACGTATGGAA GTTGGTCAAC AAGCTGTTGA 
CAATTG|\AGA TGCGAACCTT TGCATACTfTj CAACCAGTTG TTCGACAACT 

i i 

AGTtTTGGCAA GGTTTGGCCT TGTTATCTG R AGCT GTTTTG AGAGGTCAAG 
10 TCAAA^CfeTT CCAAACCGGA ACAATAGACT TCGIACAAAAC TCTCCAGTTC 

8 10 

11 12 

CCT[rGTTGGT TAACTCTTCT CAACCATGGG hACCAT TGCA ATTGCACGTC 

GGAACAaICCA ATTGAGAAGA GTTGGTACCC TTGGTftACGT TAACGTGCAG 
'12 ii 

15 BolII BamHI 

GAT^AAGCCG TCTCTGGTTT GAGATCTG 
15 CTATTTcpGC AGAGACCAAA CTCTAGACCTA G 

16 



20 



25 
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TABLE XIX 

SCEPO SECTION 3 OLIGONUCLEOTIDES 

1. GATCCAGATCTTTGACTACTTTGTT 

5 2. TCTCAACAAAGTAGTCAAAGATCTG 

3. GAGAGCTTTGGGTGCTCAAAAGGAAG 

4. ATGGCTTCCTTTTGAGCACCCAAAGC 

5. CCATTTCCCCACCAGACGCTGCTT 

6. GCAGAAGCAGCGTCTGGTGGGGAA 
10 7. CTGCCGCTCCATTGAGAACCATC 

8. CAGTGATGGTTCTCAATGGAGCG 

9. ACTGCTGATACCTTCAGAAAGTT 

10. GAATAACTTTCTGAAGGTATCAG 

11. ATTCAGAGTTTACTCCAACTTCT 
15 12. CTCAAGAAGTTGGAGTAAACTCT 

13. TGAGAGGTAAATTGAAGTTGTACAC 

14. ACCGGTGTACAACTTCAATTTACCT 

15. CGGTGAAGCCTGTAGAACTGGT 

16. CTGTCACCAGTTCTACAGGCTTC 
20 17. GACAGATAAGCCCGACTGATAA 

18. GTTGTTATCAGTCGGGCTTAT 

19. CAACAGTGTAGATGTAACAAAG 

20. TCGACTTTGTTACATCTACACT 



25 
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TABLE XX 
SC£PO SECTION 3 



Bam HI Balll 1 , 

gaTc cagaTctttg actactttgt tcagagcttt 
gtctagaaac tgatgaaaca actctpgaaa 

5 • .2 

3 5 

GGGTGCTCAA AAGGAAC fcCA TT TCCCCACC AGACGCTGCT T CTGCC GCTC 

CCCACGAGTT TTCCTTCGGT~Ap^AGGGGTGG TCTGCGACGA ASAC^GCGAG 
4 i 

7 i ^ U • 

CATTGAGAAC CATChCTGCT GATACCTTCA GAAAGTT hTT CA GAGTTTAC 

GTAACTCTTG GTAGTSaSgA CTATGGAAGT CTTTCAATAA G|rCTCAAATG 

10 8 ^ 10 . 12 

TCCAACTTCT |TGAG AGGTAA~ATTGAAGTTG TACACCGGTC AAGCCTGTAG 
AGGTTGAAGA ^^TaTCCATT TAACTTCAAC ATGTGGCC^ TTCGGACATC 
'14^ 16 • 

17 ,19 

AACTGGTfcACAGATAAGCCC GACTGATAA C AACA GTGTAG 

TTGACCAfTtnTfrATTCGGG CTGACTATTG TTG(TCACATC 
15 18 

Sai l 

ATGTAACAAA G 
TACATTGTTT CAGCT 
20 



20 



25 
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TABLE XXI 



SCEPO GENE 



-1 +1 



Hindi 1 1 
AGCTTGGATA AA 
ACCTAT TT 




5 



GGAAAGATAC TTGTTGGAAG CTAAAGAAGC TGAAAACATC ACCACTGGTT 
CCTTTCTATG AACAACCTTC GATTTCTTCG ACTTTTGTAG TGGTGACCAA 



GTGCTGAACA CTGTTCTTTG AACGAAAACA TTACGGTACC AGACACCAAG 
CACGACTTGT GACAAGAAAC TTGCTTTTGT AATGCCATGG TCTGTGGTTC 



10 GTTAACTTCT ACGCTTGGAA ACGTATGGAA GTTGGTCAAC AAGCTGTTGA 
CAATTGAAGA TGCGAACCTT TGCATACCTT CAACCAGTTG TTCGACAACT 



AGTTTGGCAA GGTTTGGCCT TGTTATCTGA AGCTGTTTTG AGAGGTCAAG 
TCAAACCGTT CCAAACCGGA ACAATAGACT TCGACAAAAC TCTCCAGTTC 



CCTTGTTGGT TAACTCTTCT CAACCATGGG AACCATTGCA ATTGCACGTC 
GGAACAACCA ATTGAGAAGA GTTGGTACCC TTGGTAACGT TAACGTGCAG 



GATAAAGCCG TCTCTGGTTT GAGATCTTTG ACTACTTTGT TGAGAGCTTT 
CTATTTCGGC AGAGACCAAA CTCTAGAAAC TGATGAAACA ACTCTCGAAA 



GGGTGCTCAA AAGGAAGCCA TTTCCCCACC AGACGCTGCT TCTGCCGCTC 
CCCACGAGTT TTCCTTCGGT AAAGGGGTGG TCTGCGACGA AGACGGCGAG 



20 CATTGAGAAC CATCACTGCT GATACCTTCA GAAAGTTATT CAGAGTTTAC 
GTAACTCTTG GTAGTGACGA CTATGGAAGT CTTTCAATAA GTCTCAAATG 



TCCAACTTCT TGAGAGGTAA ATTGAAGTTG TACACCGGTG AAGCCTGTAG 
AGGTTGAAGA ACTCTCCATT TAACTTCAAC ATGTGGCCAC TTCGGACATC 



AACTGGTGAC AGATAAGCCC GACTGATAAC AACAGTGTAG 
TTGACCACTG TCTATTCGGG CTGACTATTG TTGTCACATC 



15 



25 



Sai l 

ATGTAACAAA G 
TACATTGTTT CAGCT 
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The assembled SCEPO sections were sequenced in 



M13 and Sections 1, 2 and 3 were isolatabLe from the 
phage as Hind lll/Konl , Kpnl/Bqlll, an.d 3qlII/5alI frag- 
ments • 



► SCEPO gene products is a secretion system based on 
S >cerevisiae d-factor secretion, as described in co- 
pending U.S. Patent Application Serial No. 487,753, filed 
April 22, 1983, by Grant A. Bitter, published October 31, 
10 1984 as European Patent Application 0 123,294. Briefly 
put, the system involves constructions wherein ONA 
encoding the leader sequence of the yeast a-factor gene 
product is positioned immediately 5* to the coding region 
of the exogenous gene to be expressed. As a result, the 
15 gene product translated includes a leader or signal 

sequence which is "processed off" by an endogenous yeast- 
enzyme in the course of secretion of the remainder of the 
product. Because the construction makes use of the a- 
factor translation initiation (ATG) codon, there was no 
20 need to provide such a codon at the -1 position of the 
SCEPO gene. As may be noted from Table XXI, the alanine 
(+1) encoding sequence is preceded by a linker sequence 
allowing for direct insertion into a plasmid including 
the ONA for the first 80 residues of the a-factor leader 
25 following the a-factor promoter. The specific preferred 
construction for SCEPO gene • expression involved a four- 
part ligation including the above-noted SCEPO section 
fragments and the large fragment of Hind lll/ Sal l 
digestion of plasmid paC3. From the resulting plasmid 
30 paC3/SC£?0, the a-factor promoter and leader sequence and 
SCEPO gene were isolated by digestion with BamHI and 
ligated into Bam HI digested plasmid pYE to form 
expression plasmid pYE/SCEPO. 

35 EXAMPLE 12 



5 



The presently preferred expression system for 



The present example relates to expression of 
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recombinant products of the manufactured ECcPO and SCEPO 
genes within the expression systems of Example 11. 

In use of the expression system designed for use 
of £ .coll host cells, plasmid p536 of Example 11 was 
5 transformed into AM7 E .coll cells previously transformed 
with a suitable plasmid, pMWl, harboring a Cjg^y gene. 
Cultures of cells in LB broth (Ampicillin 50 ug/f^l and 
kanamycin 5 yg/ml, preferably with 10 mM MgSO^) were 
maintained at 28'C and upon growth of cells in culture to 

10 0.0. = 0.1, EPO expression was induced by raising the 
culture temperature to 42*C. Cells grown to about 40 
0.0. provided EPO production (as estimated by gel) of 
about 5 mg/OD liter. 

Cells were harvested, lysed, broken with French 

15 Press [10,000 psi) and treated with" lysozyme and NP-40 

detergent. The pellet resulting from 24,000 xg centrifu- 
gation was solubilized with guanidine HCl and subjected 
to further purification in a single step by means of 
C^ (Vydac) Reverse Phase HPlC CEtOH, 0-8035, 50 mM NH^Ac, 

20 pH 4.5). Protein sequencing revealed the product to be 
greater than 93% pure and the products obtained revealed 
two different amino terminals, A-P-P-R... and P-P-R... in 
a relative quantitative ratio of about 3 to 1. This 
latter observation of hEPO and [des Ala^lhEPO products 

25 indicates that amino terminal "processing*" within the 
host cells serves to remove the terminal methionine and 
in some instances the initial alanine. Radioimmunoassay 
activity for the isolates was at a level of 150,000 to 
160,000 U/mg; iji vitro assay activity was at a level of 

30 30,000 to 62,000 U/mg; and in vivo assay activity ranged 
from about 120 to 720 U/mg. (Cf., human urinary isolate 
standard of 70,000 U/mg in each assay.) The dose response 
curve for the recombinant product in the In vivo assay 
differed markedly from that of the human urinary EPO 

35 standard. 
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The EPO analog plasmids formed in parts A and B 
of Examole 11 were each transformed into pMWl- trans formed 
AM7 E , col i cells and the cells were cultured as above. 
Purified isolates were tested in both RIA and ijn vitro 
5 assays. RIA and i£ vitro assay values for [Asn^, 
des-Pro^ through hEPO expression products were 

approximately • 11 ,000 U/mg and 6,000 U/mg protein, respec- 
tively, while the assay values for [His^jhEPO were about 
41,000 U/mg and 1A,000 U/mg protein, respectively, indi- 

10 eating that the analog products were from one-fourth to 
one-tenth as "active" as the "parent" expression product 
in the assays. 

In the expression system designed for use of 
S .cerevisiae host cells, plasmid pYE/SCEPO was trans- 

15 formed into two different strains, YSDP4 (genotype a 
peD4-3 trpl ) and RK81 (genotype aa pep4-3 trol ) > 
Transformed YSDP4 hosts were grown in SO medium (Methods ' 
in Yeast Genetics, Cold Spring Harbor Laboratory, Cold 
Spring Harbor, N.Y., p. 62 (1983) supplemented with casa- 

20 mino acids at 0.3%, pH 6.5 at 30'C. Media harvested when 
the cells had been grown to 36 0.0. contained EPO pro- 
ducts at levels of about 244 U/ml (97 ug/OD liter by 
RIA). Transformed RK81 cells grown to either 6.5 0.0. or 
60 0.0. provided media with EPO concentrations of about 

25 80-90 U/ml (34 pg/OD liter by RIA). Preliminary analyses 
reveal significant heterogeneity in products produced by 
the expression system, likely to be due to variations in 
glycosylation of proteins expressed, and relatively high 
mannose content of the associated carbohydrate. 

30 Plasmids PaC3 and pYE in HBlOl E .coli cells were 

deposited in accordance with the Rules of Practice of the 
U.S. Patent Office on September 27, 1984, with the 
American Type Culture Collection, 12301 Parklawn Drive, 
Rockville, Maryland, under deposit numbers A.T.C.C. 39881 

35 and A.T.C.C. 39882, respectively. Plasmids pCFM526 in 
AM7 cells, pCFM536 in JM103 cells, and pMWl in JM103 
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cells were likewise deposited on November 21, 1984 as 
A.T.C-C. 33932, 3393A, and 33933, respectively. 
Saccharomyces cer e v is iae strains YSPOA and RK31 were 
deposited on iNovember 21, 1984 as A,T,C-C, 20734 and 
5 20733, respectively. 

It should be readily apparent from consideration 
of the above illustrative examples that numerous excep- 
tionally valuable products and processes are provided .by 
the present invention in its many aspects. 

10 Polypeptides provided by the invention are 

conspicuously useful materials, whether they are micro- 
bially expressed products or synthetic products, the pri- 
mary, secondary or tertiary structural conformation of 
which was first made known by the present invention. 

15 As previously indicated, recomdinant-pr educed 

and synthetic products of the invention share, to varying 
degrees, the iji vitro biological activity of EPO isolates 
from natural sources and consequently are projected to 
have utility as substitutes for EPO isolates in culture 

20 media employed for growth of erythropoietic cells in 

culture. Similarly, to the extent that polypeptide pro- 
ducts of the invention share the in^ vivo activity of 
natural EPO isolates they are conspicuously suitable for 
use in erythropoietin therapy procedures practiced on 

25 mammals, including humans,, to develop any or all of the 
effects herefore attributed in vivo to EPO, e.g., stimu- 
lation of reticulocyte response, development of ferroki- 
netic effects (such as plasma iron turnover effects and 
marrow transit time effects), erythrocyte mass changes, 

30 stimulation of hemoglobin C synthesis (see, Eschbach, et 
al., supra ) and, as indicated in Example 10, increasing 
hematocrit levels in mammals. Included within the class 
of humans treatable with products of the invention are 
patients generally requiring blood transfusions and 

35 including trauma victims, surgical patients, renal 
disease patients including dialysis patients, and 
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patients with a variety of blood composition affecting 
disorders, such as hemophilia, sickle cell disease, phy- 
siologic anemias, and the like. The minimization of the 
need for transfusion therapy through use" of EPO therapy 
5 can be expected to result in reduced transmission of 

infectious agents. Products* of the invention, by virtue 
of their production by recombinant methods, are expected 
to be free of pyrogens, natural inhibitory substances, 
and the like, and are thus likely to provide enhanced 

10 overall effectiveness in therapeutic processes vis-a-vis 
naturally derived products. Erythropoietin therapy with 
products of the present invention is also expected to be 
useful in the enhancement of oxygen carrying capacity of 
individuals encountering hypoxic environmental conditions 

15 and possibly in providing beneficial cardiovascular 
effects. 

A preferred method for administration of poly- 
peptide products of the invention is by parenteral (e.g., 
IV, IM, SC, or IP) routes and the compositions admi- 

20 nistered would ordinarily include therapeutically 

effective amounts of product in combination with accep- 
table diluents, carriers and/or adjuvants. Preliminary 
pharmacokinetic studies indicate a longer half-life in 
vivo for monkey EPO products when administered IM rather 
.25 than IV. Effective dosages are expected .to vary substan- 
tially depending upon the cohdition treated but thera- 
peutic doses are presently expected to be in the range of 
0.1 ('^7U) to 100 (-^700011) ug/kg body weight of the active 
material. Standard diluents such as human serum albumin 

30 are contemplated for pharmaceutical compositions of the 
invention, as are standard carriers such as saline. 

Adjuvant materials suitable for use in com- 
positions of the invention include compounds indepen- 
dently noted for erythropoietic stimulatory effects, such 

35 as testosterones , progenitor cell stimulators, 

insulin-like growth factor, prostaglandins, serotonin, 
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cyclic AMP, prolactin and triiodothyronine, as well as 
agents generally employed in treatment of aplastic ane- 
mia, such as methenolene, stanozolol and nandrolone [see, 
e.g., Resegotti, et al., Panminerva Medica , 23 , , 243-248 
5 (1981]; McGonigle, et al., Kidney Int > , 25(2) , 437-444 
(1984); Paviovic-Kantera, et al., Expt .Hematol . , 8(5udo . 
82, 283-291 (1980); and Kurtz, FEBS Letters, I4a(l) , 
105-108 (1982)]. Also contemplated as adjuvants are 
substances reported to enhance the effects of, or 

10 synergize, erythropoietin or asialo-EPO, such as the 

adrenergic agonists, thyroid hormones, androgens and 3PA 
[see, Dunn, "Current Concepts in Erythropoiesis" , John 
Wiley and Sons (Chichester, England, 1983); Weiland, et 
al., Blut, 44(3) , 173-175 (1982); Kalmanti, Kidney Int . , 

15 22, 383-391 (1982); Shahidi, New .Enq . J .Med . , 289, 72-80 
( 1973); Fisher, et al. , Steroids , 30( 6) , 833-845 (1977); 
Urabe, et al . , 3. Exp, Med. , 149, 1314-1325 (1979); and 
Billat, et al., Expt . Hema tol . , 10(1) , 133-140 (1982)] as 
well as the classes of compounds designated hepat ic 

20 erythropoietic factors" [see, Naughton, et al., 

Acta.Haemat . , 69, 171-179 (1983)] and " erythrotropins" 
[as described by Congote, et al. in Abstract 364, 
Proceedings 7th International Congress of Endocrinology 
(Quebec City, Quebec, July 1-7, 1984); Congote, 

25 Biochem.BioDhys. Res. Comm. , 115( 2) , 447-483 (1983) and 
Congote, Anal . Biochem . , 140 , 428-433 (1984)] and 
" er ythrogenins" [as described in Rothman, et al., 
J.Suro.Oncol. , 20, 105-108 (1982)]- Preliminary 
screenings designed to measure erythropoietic responses 

30 of ex-hypoxic polycythemic mice pre-treated with either 
5-a-dihydrotestosterone or nandrolone and then given 
erythropoietin of the present invention have generated 
equivocal results • 

Diagnostic uses of polypeptides of the invention 

35 are similarly extensive and include use in labelled and 
unlablled forms in a variety of immunoassay techniques 
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including RIA's, ELISA's and the like, as well as a 
variety of in vitro and iji vivo activity assays. See, 
e-g., Dunn, et al., Expt .Hematol . , 11(7) , 590-6C0 (1983); 
Gibson, et al., Pathology , 16, 155-156 C1984); Krystal', 
5 ExDt .Hematol, , 11(7) , 649-660 (1983); Saito, et al., 
Jao.J.Med, , 23(1 ) , 16-21 (1984); Nathan, et al.. 
New Eng. J, Med: , 308(9) , 520-522 C 1983); and various 
references pertaining to assays referred to therein. 
Polypeptides of the invention, including synthetic pep- 

10 tides corrprising sequences of residues of EPO first 
revealed herein, also provide highly useful pure 
materials for generating polyclonal antibodies and 
"banks" of monoclonal antibodies specific for differing 
continuous and discontinuous epitopes of EPO. As one 

15 example, preliminary analysis of the amino acid sequences 
of Table VI in the context of hydropathici t y according to 
Hopp, et al., P.N .A. 5. (U.S ,A , ) , 78, pp. 3824-3828 
(1931) and of secondary structures according to Chou, et 
al., Ann .Rev . aiochem . , 47 , p. 251 (1978) revealed that 

20 synthetic peptides duplicative of continuous sequences of 
residues spanning positions 41-57 inclusive, 116-118 
inclusive and 144-166 inclusive are likely to produce a 
highly antigenic response and generate useful monoclonal 
and polyclonal antibodies immunoreac t ive with both the 

25 synthetic peptide and the entire protein. Such antibo- 
dies are expected to be useful in the detection and affi- 
nity purification of EPO and EPO-related products. 

Illustratively, the following three synthetic 
peptides were prepared: 

30 

(1) hEPO 41-57, V-P-O-T-K-V-N-F-Y-A-W-K- 

R-M-E-V-G; 

(2) hEPO 116-128, K-E-A-I-S-P-P-O-A-A-S-A-A; 

(3) hEPO 144-166, V-Y-S-N-F-L-R-G-K-L-K-L-Y- 
35 T-G-E-A-C-R-T-G-O-R. 
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Preliminary immunization studies employing the above- 
noted poiyoeptides have revealed a relatively weak posi- 
tive response to hEPO 41-57, no appreciable response to 
hEPO 116-128, and a strong positive resopnse to hEPO 
5 144-166, as measured by capacity of rabbit serum antibo- 
dies to immunoprecipitate ^^^I-labelled human urinary EPO 
isolates. Preliminary in vivo activity studies on the 
three peptides revealed no significant activity either 
alone or in combination. 

10 While the deduced sequences of amino acid resi- 

dues of mammalian- EPO provided by the illustrative 
examples essentially define the primary structural con- 
formation of mature EPO, it will be understood that the 
specific sequence of 165 amino acid residues of monkey 

15 species EPO in Table V and the 166 residues of human spe- 
cies EPO in Table VI do not limit the scope of useful 
polypeptides provided by the invention. Comorehended by 
the present invention are those various naturally- 
occurring allelic forms of EPO which past research into 

20 biologically active mammalian polypeptides such as human 
Y interferon indicates are likely to exist. (Compare, - 
e.g., the human immune interferon species reported to 
have an arginine residue at position No. 140 in EPO 
published application 0 077 670 and the species reported 

25 to have glutamine at position No. 140 in Gray, et al., 
Nature , 295 , pp. 503-508 (1982). Both species are 
characterized as constituting "mature" human y interferon 
sequences.) Allelic forms of mature EPO polypeptides may 
vary from each other and from the sequences of Tables V 

30 and VI in terms of length of sequence and/or in terms of 
deletions, substitutions, insertions or additions of 
amino acids in the sequence, with consequent potential 
variations in the capacity for glycosylat ion . As noted 
previously, one putative allelic form of human species 

35 EPO is believed to include a methionine residue at posi- 
tion 126. Expectedly, naturally-occurring allelic forms 
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of EPO-encoding ONA genomic and cDNA sequences are also 
likely to occur which code for the above-noted types of 
allelic polypeptides or simply employ differing codons 
for designation of the same polypeptides as specified. 
5 In addition to naturally-occurring allelic forms 

of mature EPO, the present invention also embraces other 
"EPO products" such as polypeptide analogs of EPO and 
fragments of "mature" EPO. Following the procedures of 
the above-noted published application by Alton, et al. 
10 CwO/83/04053) one may readily design and manufacture 
genes coding for microbial expression of polypeptides 
having primary conformations which differ from that 
herein specified for mature EPO in terms of the identity 
or location of one or more residues (e.g., substitutions, 
15 terminal and intermediate additions and deletions). 

Alternately, modifications of cONA and genomic EPO genes 
may be readily accomplished by well-known site-directed 
mutagenesis techniques and employed to generate analogs 
and derivatives of EPO. Such EPO products would share at 
20 least one of the biological properties of EPO but may 

differ in others. As examples, projected EPO products of 
the invention include those which are foreshortened by 
e.g., deletions [Asn^, des-Pro^ through Ile^]hEPO, 
[des-Thr^^^ through Arg^^^]hEPO and " A27-55hEP0" , the 
25 latter having the residues coded for by an entire exon 
deleted; or which are more stable to hydrolysis (and, 
therefore, may have more pronounced or longer lasting 
effects than naturally-occurring EPO); or which have been 
altered to delete one or more a potential sites for gly- 
30 cosylation (which may result in higher activities for 
yeast-produced products); or which have one or more 
cystein residues deleted or replaced by, e.g., histidine 
or serine residues (such as the analog [His^]hEPO) and 
are potentially more easily isolated in active form from 
35 microbial systems; or which have one or more tyrosine 
residues replaced by phenylalanine (such as the analogs 
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[phe-^lhEPO, [Phs^^lhEPO, and [Phe^^^l hEPO ) and may bind 
more or less readily to £P0 receptors on target cells. 
Also comprehended are polypeptide fragments duplicating . 
only a part of the continuous amino acid sequence or 
5 secondary conformations within mature EPO, which 

fragments may possess one activity of EPO (e.g., receptor 
binding) and not others (e.g., erythropoietic activity). 
Especially significant in this regard are those potential 
fragments of EPO which are elucidated upon consideration 
10 of the human genomic ONA sequence of Table VI, i.e., 
"fragments" of the total continuous EPO sequence which 
are delineated by intron sequences and which may consti- 
tute distinct "domains" of biological activity. It is 
noteworthy that the absence of in vivo activity for any 
15 one or more of the "EPO products" of the invention is not 
wholly preclusive of therapeutic utility (see, Weiland, 
et al., suora ) or of utility in other contexts, such as 
in EPO assays or EPO antagonism. Antagonists of erythro- 
poietin may be auite useful in treatment of polycythemias 
20 or cases of overproouction of EPO [see, e . g . , Adamson , 
Hoso. Practi ce, 18(12) . 49-57 (1983), and Hellmann, et 
al., Clin. Lab. Haemat. , 5, 335-342 (1983)1. 

According to another asoect of the present 
invention, the cloned ONA sequences described herein 
25 which encode human and monkey EPO polypeptides are 

conspicuously valuable for the information which they 
provice concerning the amino acid seauence of mammalian 
erythropoietin which has heretofore been unavailable 
despite decades of analytical processing of isolates of 
30 naturally-occurring products. The ONA sequences are also 
conspicuously valuable as products useful in effecting 
the large scale microbial synthesis of erthropoietin by a 
variety of recombinant techniques. Put another way, ONA 
sequences provided by the invention are useful in 
35 generating new and useful viral and circular plasmid DNA 
vectors, new and useful transformed and trans^'ected 
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microbial procaryotic and eucaryotic host cells 
(including bacterial and yeast cells and mammalian cells 
grown in cul tur e ] , ■ and new and useful methods for 
cultured growth of such microbial host cells capable of 
5 expression of EPO and EPO products., ONA sequences of the 
invention are also conspicuously suitable materials for 
use as labelled probes in isolating EPO and related pro- 
tein encoding cONA and genomic DNA sequences of mammalian 
species other than human and monkey species herein speci- 

10 fically illustrated. The extent to which DNA sequences 
of the invention will have use in various alternative 
methods of protein synthesis [e.g., in insect cells) or 
in genetic therapy in humans and other mammals cannot yet 
be calculated, DNA sequences of the invention are 

15 expected to be useful in developing transgenic mammalian 
species which may serve as eucaryotic *'hosts*' for produc- 
tion of erythropoietin and erythropoietin products in 
quantity. See, generally, Palmiter, et al . , Science , 
222(^625) , 809-814 (1983). 

20 Viewed in this light, therefore, the specific 

disclosures of the illustrative examples are clearly not 
intended to be limiting upon the scope of the present 
invention and numerous modifications and variations are 
expected to occur to those skilled in the art. As one 

25 example, while DNA sequences provided by the illustrative 
examples include cONA and genomic DNA sequences, because 
this application provides amino acid sequence information 
essential to manufacture of DNA sequence, the invention 
also comprehends such manufactured DNA sequences as may 

30 be constructed based on knowledge of EPO amino acid 

sequences. These may code for EPO (as in Example 12) as 
well as for EPO fragments and EPO polypeptide analogs 
(i.e., "EPO Products") which may share one or more biolo- 
gical properties of naturally-occurring EPO but not share 

35 others (or possess others to different degrees). 

DNA sequences provided by the present invention 
are thus seen to comprehend all DNA sequences suitable 
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for use in securing expression in a procaryotic or 
eucaryotic host cell of a polypeptide product having at 
least a part of the primary structural conformation and 
one or mcire of the biological properties of erythro- 
5 poietin, and selected from among: (a) the DNA sequences 
set out in Tables V and VI; (b) DNA sequences which 
hybridize to the DNA sequences defined in (a) or 
fragments thereof; and (c) DNA sequences which, but for 
the degeneracy of the genetic code, would hybridize to 

10 the DNA sequences defined in (a) and (b). It is 

noteworthly in this regard,- for example, that existing 
allelic monkey and human EPO gene sequences and other 
mammalian species gene sequences are expected to hybri- 
dize to the sequences of Tables V and VI or to fragments 

15 thereof. Further, but for the degeneracy of the genetic 
code, the SCEPO and ECEPO genes and the manufactured or 
mutagenized cDNA or genomic DNA sequences encoding 
various EPO fragments and analogs would also hybridize to 
the above-mentioned DNA sequences. Such hybridizations 

20 could readily be carried out under the hybridization con- 
ditions described herein with respect to the initial iso- 
lation of the monkey. and human EPO-encoding DNA or more 
stringent conditions, if desired to reduce background 
hybridization. 

25 In a like manner, while the above examples 

illustrate the invention of microbial expression of EPO 
products in the context of mammalian cell expression of 
DNA inserted in a hybrid vector of bacterial plasmid and 
viral genomic origins, a wide variety of expression 

30 systems are within the contemplation of the invention. 
Conspicuously comprehended are expression systems 
involving vectors of homogeneous origins applied to a 
variety of bacterial, yeast and mammlain cells in culture 
as well as to expression systems not involving vectors 

35 (such as calcium phosphate transfection of cells). In 
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this rsgard, it will be understood that expression of, 
e.g., monkey origin ONA in monkey host cells in culture 
and human host cells in culture, actually constitute 
instances of "exogenous" ONA expression inasmuch as the 
5 EPO ONA whose high level expression is sought would not 
have its origins in the genome of the host. Expression 
systems of the invention further contemplate these prac- 
tices resulting in cytoplasmic formation of EPO products 
and accumulation of glycosylated and non-glycosylated EPO 
10 products in host cell cytoplasm or membrances (e.g., 
accumulation in bacterial periplasmic spaces) or in 
culture medium supernatants as above illustrated, or in 
rather uncommon systems such as P .aeruginosa expression 
systems (described in Gray, et al., Biotechnology , 2, pp. 
15 161-165 (1984)). 

Improved hybridization methodologies of the 
invention, while illustratively applied above to ONA/ONA 
hybridization screenings are egually applicable to 
RNA/RNA and RNA/DNA screening. Mixed probe technigues as 
20 herein illustrated generally constitute a number of 

improvements in hybridization processes allowing for more 
rapid and reliable polynucleotide isolations. These many 
individual processing improvements include: improved 
colony transfer and maintenance procedures; use of nylon- 
25 based filters such as GeneScreen and GeneScreen Plus to 
allow reprobing with same filters and repeated use of the 
filter, application of novel protease treatments 
[compared, e.g., to Taub, et al. Anal .Biochem. , 126 , pp. 
222-230 (1982)]; use of very low individual con- 
30 centrations (on the order of 0.025 picomole) of a large 
number of mixed probes (e.g., numbers in excess of 32); 
and, performing hybridization and post-hybridization 
steps under stringent temperatures closely approaching 
(i.e., within A*c and preferably within 2'C away from) 
35 the lowest calculated dissocation temperature of any of 
the mixed probes employed. These improvements combine to 
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provide results which could not be exoected to attend 
their use. This is amply illustrated by the fact that 
mixed probe procedures involving 4 times the number of 
probes ever before reported to have been successfully 
5 used in even cDNA screens on messenger RNA species of 

relatively low abundancy were successfully applied to the 
isolation of a unique sequence gene in a genomic library 
screening of 1,500,000 phage plaques. This feat was 
accomplished essentially concurrently with the publica- 
10 tion of the considered opinion of Anderson, et al., 
supra , that mixed probe scr-eening methods were 
"...impractical for isolation of mammalian protein genes 
when corresponding RNA^s are unavailable. 

15 



20 



/ 

25 



30 
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WHAT IS CLAIMED IS: 

1. A purified and isolated polypeptide having 
part or all of the primary structural conformation and 
5 one or more of the biological properties of naturally- 
occurring erythropoietin and characterized by being the 
product of procaryotic or eucaryotic expression of an 
exogenous ONA sequence. 

10 2. A polypeptide according to claim 1 further 

characterized by being free of association with any mam- 
malian protein, 

3. A polypeptide according to claim 1 wherein 
15 the exogenous DNA sequence is a cDNA sequence. 

4. A polypeptide according to claim 1 wherein 
the exogenous DNA sequence is a manufactured DNA 
sequence . 

20 

5. A polypeptide according to claim 1 wherein 
the exogenous DNA sequence is a genomic DNA sequence. 

6. A polypeptide according to claim 1 wherein 
25 the exogenous DNA sequence is carried on an autonomously 

replicating circular DNA plasmid or viral vector. 

7. A polypeptide according to claim 1 
possessing part or all of the primary structural confer- 

30 mation of human erythropoietin as set forth in Table VI 
or any naturally occurring allelic variant thereof. 

8. A polypeptide according to claim 1 
possessing part or all of the primary structural confor- 

35 mation of monkey erythropoietin as set forth in Table V 
or any naturally occurring allelic variant thereof. 
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9. A polypeptide according to claim 1 which has 
the immunological properties of naturally-occurring 
erythropoietin. 

5 10. A polypeptide according to claim 1 which 

has the in^ vivo biological activity of naturally- 
occurring erythropoietin. 

11. A polypeptide according to claim 1 which 
10 has the i]i vitro biological activity of naturally- 
occurring erythropoietin. 

12. A polypeptide according to claim 1 further 
characterized by being covalently associated with a 

15 detectable label substance. 

13- A polypeptide according to claim 12 wherein 
said detectable label is a radiolabel. 

20 14. A DNA sequence for use in securing 

expression in a procaryotic or eucaryotic host cell of a 
polypeptide product having at least a part of the primary 
structural conformation and one or more of the biological 
properties of naturally-occurring erythropoietin, said 

25 DNA sequence selected from among: 

Ca) the DNA sequences set out in Tables V and 
VI or their complementary strands; 

(b) DMA sequences which hybridize to the DNA 
sequences defined in (a) or fragments thereof; and 

30 (c) DNA sequences which, but for the degeneracy 

of the genetic code, would hybridize to the DNA sequences 
defined in (a) and (b). 

15. A procaryotic or eucaryotic host cell 
35 transformed or transfected with a DNA sequence according 
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to claim 14 in a manner allowing the host cell to express 
said polypeptide product, 

16. A polypeptide product of the expression of 
5 a ONA sequence of claim 14 in a procaryotic or eucaryotic 

host • 

17. A purified and isolated DNA sequence coding 
for procaryotic or eucaryotic host expression of a poly- 

10 peptide having part or all of the primary structural con- 
formation and one or more of the biological properties of 
erythropoietin . 

18. A cDNA sequence according to claim 17. 

15 

19. A monkey species erythropoietin coding DNA 
sequence according to claim 18. 

20. A ONA sequence according to claim 19 and 

20 including the protein coding region set forth in Table V. 

21. A genomic ONA sequence according to claim 

17. 

25 22. A human species erythropoietin coding DNA 

sequence according to claim -21. 

23. A ONA sequence according to claim 22 and 
including the protein coding region set forth in Table. . 

30 VI. 

24. A manufactured DNA sequence according to 

claim 14. 

35 25. A manufactured ONA sequence according to 

claim 24 and including one or more codons preferred for 
expression in E . coli cells. 
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26. A manufactured ONA sequence according to 
claim 25, coding for expression of human species erythro- 
poietin. 



claim 26 including the protein coding region set forth in 
Ta.ble XIV. 

23. A manufactured ONA sequence according to 
10 claim 2A and including one or more codons preferred for 
expression in yeast cells. . 

29. A manufactured ONA sequence according to 
claim 23, coding for expression of human species erythro- 

15 poietin. 

30. A manufactured ONA sequence according to 
claim 29 including the protein coding region set forth in 
Table XXI. 

20 



5 



27. A manufactured ONA sequence according to 



31. A ONA sequence according to claim 17 cova- 
lently associated with a detectable label substance. 



32. ■ A ONA sequence according to claim 31 
25 wherein the detectable label is a radiolabel. 



33. A single-Strand ONA sequence according to 



claim 31. 



30 



3A. A DNA sequence coding for a polypeptide 
fragment or polypeptide analog of naturally-occurring 
erythropoietin . 



35 
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35. A DNA sequence coding for [Phe^^lhEPO, 
[Phe^^lhEPO, [Phe^^^JhEPO, [his^lhEPO, [Asn^ 

ies-Pro^ through Ile^JhEPO, [des-Thr ^^-^ through 
Arg^^^lhEPO, or [a27.55] hEPO . 

5 

36. A DNA sequence according to claim 34 which 
is a manufactured sequence. 

37. A biologically functional circular plasmid 
10 or viral DNA vector including a DNA sequence according to 

either of claims 14, 17, 34 or 35. 

38. A procaryotic or eucaryotic host cell 
stably transformed or transfected with a DNA vector 

15 according to claim 37. 

39. A polypeptide product of the expression in 
a procaryotic or eucaryotic host cell of a DNA, sequence 
according to claims 17 or 34. 

20 

40. A glycoprotein product having a primary 
structural conformation sufficiently duplicative of that 
of a naturally-occurring erythropoietin to allow 
possession of one or more of the biological properties 

25 thereof and having an average carbohydrate composition 
which differs from that of naturally-occurring erythro- 
poietin . 



41. A glycoprotein product having a primary 
30 structural conformation sufficiently duplicative of that 
of a naturally-occurring human erythropoietin to allow 
possession of one or more of the biological properties 
thereof and having an average carbohydrate composition 
which differs from that of naturally-occurring human 
35 erythropoietin . 
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42. Vertebrate cells which can be prooagated in 
vitro continuously and which upon growth in culture are 
capable of producing in the medium of their growth in 
excess of 100 U of erythropoietin per 10^ cells in 48 

5 hours as determined by radio immunoassay • 

43. Vertebrate cells according to claim 42 
capable of producing in excess of 500 U erythropoietin 
per 10^ cells in 43 hours, 

10 

44. Vertebrate cells according to claim 42 
capable of producing in excess of 1,000 U erythropoietin 
per 10^ cells in 43 hours. 

15 45. Vertebrate cells according to claim 42 

which are mammalian or avian cells. 

46. Vertebrate cells according to claim 45 
which are COS-1 cells or CHO cells. 

20 - 

47. A synthetic polypeptide having part or all 
of the amino acid sequence as set forth in Table V and 
having one or more of the in, ^^^^ iQ vitro biological 

■activities of naturally-occurring monkey erythropoietin. 

25 

48. A synthetic polypeptide having part or all 
of the amino acid sequence set forth in Table VI, other 
than a sequence of residues entirely within the sequence 
numbered 1 through 20, and having a biological property 

30 of naturally-occurring human erythropoietin. 

49. A synthetic polypeptide having part or all 
of the secondary conformation of part or all of the amino 
acid sequence set forth in Table VI, other than a 

35 sequence of residues entirely within the sequence num- 
bered 1 through 20, and having a biological property of 
naturally- occurring human erythropoietin. 
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50. A process for the production of a polypep- 
tide having part or all of the primary structural ccnfor- 
mation and one or more of the biological properties of 
naturally-occurring erythropoietin, said process compri- 

5 sing: 

growing, under suitable nutrient conditions, 
procaryotic or eucaryotic host cells transformed or 
transfected with a DNA vector according to claim 37, and 
isolating desired polypeptide products of the expression 
10 of DNA sequences in said vector. 

51. An antibody substance characterized by 
immunoreact iv ity with erythropoietin and with a synthetic 
polypeptide having a primary structural conformation 

15 substantially duplicative of a continuous sequence of 
amino acid residues extant in naturally-occurring 
erythropoietin except for any polypeptide comprising a 
sequence of amino acid residues entirely comphrended 
within sequence, 

20 A-P-P-R-L-I-C-O-S-R-V-L-E-R-Y-L-L-E-A-K. 

52. An antibody according to claim 51, which is 
a monoclonal antibody. 

25 53. An antibody according to claim 51, which is 

a polyclonal antibody. 

54. An antibody according to claim 51, which is 
immunoreact ive with erythropoietin and a synthetic poly- 
30 peptide having the sequence selected from the sequences: 
V-P-D-T-K-V-N-F-y-A-W-K-R-M-E-V-G, 
K-E-A-I-S-P-P-O-A-A-S-A-A, and 

V-Y-S-N-F-L-R-G-K-L-K-L-Y-T-G-E-A-C-R-T-G-O-R. 

35 
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55. A pharmaceutical composition comprising an 
effective amount of a polypeotide according to claims 1, 
16, 39, 40 or 41 and a pharmaceut ical 1 y acceptable 
diluent ,. ad juvant or carrier. 

56. A method for providing erythropoietin 
therapy to a mammal comprising administering an effective 
amount of a polypeptide according to claims 1, 16, 39, 40 
or 41 . 

57. A method according, to claim 56 wherein the 
therapy comprises enhancing hematocrit levels. 

58. A purified and isolated DNA sequence as set 
15 out in Table V or VI or a fragment thereof or the comple- 
mentary strand of such a sequence or fragment. 



10 



59. A polypeptide product of the expression of 
a DNA sequence according to claim 58 in a procaryotic or 
20 eucaryotic host cell. 



60. An improvement in the method for detection 
of a specific single stranded polynucleotide of unknown 
sequence in a heterogeneous cellular or viral sample 
25 including multiple single-stranded polynucleotides 
wherien: 

(a) a mixture of labelled single-stranded poly- 
nucleotide probes is prepared having uniformly varying 
sequences of bases, each of said probes being potentially 

30 specifically complementary to a sequence of bases which 
is putatively unique to the polynucleotide to be 
detected, 

(b) the sample is fixed to a solid substrate; 

(c) the substrate having the sample fixed 

35 thereto is treated to diminish further binding of poly- 
nucleotides thereto except by way of hybridization to 
polynucleotides in said sample, 
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(d) the treated substrate having the sample 
fixed thereto is transitorily contacted with said mixture 
of labelled probes uncer conditions facilitative of 
hybridization only between totally complementary poly- 
5 nucleotides, and, 

Ce) the specific polynucleotide is detected by 
monitoring for the presence of a hybridization reaction 
between it and a totally complementary probe within said 
mixture of labelled probes, as evidenced by the presence 
10 of a higher density of labelled material on the substrate 
at the locus, of the specific polynucleotide in comparison 
to a background density of labelled material resulting 
from non-specific binding of labelled probes to the 
substrate , 

15 said improvement comprising using in excess of 

32 mixed probes and performance of one or more of the. 
following : 

(1) employing a nylon-based paper as said solid 
substrate ; 

20 (2) treating with a protease in steo (c); 

C3) employing individual labelled probe con- 
centrations of approximately 0.025 picomoles; and 

(4) employing as one of the hybridization con- 
ditions in step (d) stringent temperatures approaching to 
25 with 4*C away from the lowest calculated Td of any of the 
probes employed. 
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